Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1984 |
Symbol | |
ID | 5733873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2438817 |
End bp | 2441696 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279128 |
Product | hypothetical protein |
Protein accession | YP_001544755 |
Protein GI | 159898508 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGGCGA CCCCGCTACC AATGTTTGCT CGAATCATCC GTTTGGTCGA TCAGCGGCCA TTAAGTGCCC TCGTGATCGC GCATCGGCGG CTGCTCGCGA TGGAATCAAG CGCGTGTGTA GACCATGGTT GGGCTTGGTT TTGTTATGGC TGGGCCGCGC TGCATGGCGA AAAAGTTTCC GAAGGCCTAG CCGCGTTGCA ACAAGCGCAA GCTCTTTTCG CCCAACATCA TGATCAAGCG GGCATGTGGG ATTGTCGTCA AGCGTTGCTG GTTGGTCGAT GGCTACAAGG AGAAGGGGTT TCGTTGCAAC AGGCATGGCA ACCAGTCATC GAAGCCCATT TGCGACTTGG TGGTGCTCGC GCCGCTGCCG AGGCCCAAAT TTATCAATTG ATTCATCTTA ATTACCTTAA ACGCTATCAA GCTGTCCTTG ATCTGGCTGC CACAATCGCG CCGCATCTGA CCACTGCTCC GCAATTCGTG GCTGGTCGGT TTATGCGTAT TGTGGCGATT GCCAATGCAG GCCTTGGCGC TTTTACCAAA GCCAAGCAGG GTTTAGATCA GGCATTGTAC GCTTCTCAAC AAGCCAAGGC GTGGGTTGAT GTAGCTAAAT GCCTTCGTGA GCGGGGCTTT ATTGCTGATC GTCAAGAACA CTATGCAGAT GCCGTAGCCG ACCTTCAACA GGCGATGACC TGGTTTAATC GTTTGGGCAT GCCACTATAT GCGGCCTTAT GCCAGCGAGC ACTTGGCTTG GCACTAAGCA GAATTGGTCA CTACGATCAG GGTTTGCGGT TTAACCTTGC AGCGCGTAGC AGCTTTCGTA TGCTTGATCG GCCCGATTTG GCGGCTGGAT GCGACCAAAA TATTGGCGTG ATTGCCCATT ATGTGCGGCT GCCGCTCATT GCTCAATCGG CCTATCAGCG AGCGTTGGCG GTCTATCAAG CGCGTTCGAG TACCTATGAT AGTTGTGTGC TTCAGCGCAA TTTTGCCCTG TTACAGATTA ATCAAGGCAA TGGTCAGCTT GCGCTCGACT TGCTCGGCGC GATTCAACCT TTAGTGCTAG CCTTAGATGA TCAGTTAGAG CTTGGCGAGT TTTACGAGGC ACTTGCACAG GCGTGGCACT GCCTTGGTGC TGTTGACCAA GCTCAGCGCT TTTTCGATCA GGCAATTGTT TGTTTTGAGG CGATTGGCAA TCAGATTAAT ATTGCCAAAT GTCAGCTTGG TCAAGCTTGG TTGATGCTTG AAACAGGCAA TTGGCAGCTA GCACAGGCTT TGTTAGCGCA AGCTCAGGTG TGGTTGCTTG AACATCCGAC CCATCGCTGG CGTTGTCATT ATGGCTTAGG GTATTGTGCT GCACAGGCTG GGGCTAGCGG GCGGGCCATG GCAGAATATA TCGCTGCTTG TGGCATTGTT GCCCAACTTC GTCAAGCTTT GAGTAGCGAA CATGCTTCGA GTGCGATTTT TGCCCAAGCG CAGCAACTGT ATCACGATAC CATTCGATTG GCGCTTGCCC AGGCCAATAG TAACTTGGCA TGGCAATTAA TCGAGCAACA ACGAGCATTG GTGTTAAATC GCCAAATGCG TTGTTTACCA CTAGCGTTTG ACCCAGACTT GGCAGAGGAA GATCAGCGCT ATCATGCCCG TTTGAGTAGC TTAACGCAGC CAACTGCTGG TCATGAGCAC GTAGAGGCAC TATTCGCCGA TTATATCAAT TTCTTAATTC AAGCGCGACA TACCCTTGAA GGTTCGCTTG CCGATTTGGC GATTGATCGG CCACTTGAGG TTGTTTGTGG CGAACTTGAT GAAGCCTTTG ATGGGGATTG GACATGGCTC GGCTACAGTC AATTAGGCGA CGATCTGCTG ATTATCACGC TCTATGCTGG GCAGATTACG GTTATTCGCC AGCCAATTGA TCGGCGTTTC TGTGAGTTAT TAGGCTTGGC CAACTCGTTT ACTGACCATT CAATTTTGTA TGAGGATTGG TCGTTTCTGA GCCAAGCTGC TCCTTTCGCC GAGCTTCGGG CCTTATCTGA TCGACTTATC CCTAGCGTCG TTAAGCAACG CTTGCATCAA AACCATCGCC TATTAATCAC ACCATGTACC AAGCTGCATC AGGTTGCTTG GGCAGCTTTG CTGGTCAACC AACAGCGACT TTGCCAAACC TGTATTCCAC AGATTATTCC ATCGTTGGGC ACATGGTCAT GGTTGCAGGC ACGTCAAAGC TTGGGCACTG AGGCGTTATT GCTGGGCTGT GACAATTTTG GCGAGCGAGC GGACGCGTTG CCACACATTC AAGCAGAACT GCGGGTTGTT GCGCAACAAG TAACCATCCC CGTCAGCACG CTGTTTGGAG CCGAAGCAAC AGGCGTGGCC GTGTTGAAGC TAGGTCAAGC TGGGTTATTA CAACGCTTTC GGCATATTCA TATTGCAACC CATGCGCAAT TAATCGCTGC CCGTGGGTTA CTTGCCCATA TTAAGCTTGT GGATGGCGAT ATGTTTTACA ACGATATCCT CAATTTACGG CTTGCTGGGG CGACGGTGGT GTTATCGACA TGTGATGGCT CGCTGAGTGA AACGTTGCTT GGCGAAGAAG TGCTAAGTTT GAGCCGCGCT TTTTTGGCTG GTGGCGCACG TGAGGTGCTA GCCAATGGGT GGAAAACTAG CGATAGCGGG GTGGTTGAGT TGATGCGGTT ATTTTATCAC TATTTAGCCT ACCCAAATGA TGGGGCAACG GCTTTGGCAA TGGCCCAACG CACATTACTT GAATCTGACG ATCCCAGCCA AGCTGCAGTC TTGGTCTGGG GTGGCTTTCA GGTTGTTGGG GCTGGAACAC TGGCGCAATG GCCATCTGCG CAGATTCCGT CGATCAGCGT CGGTGATTAA
|
Protein sequence | MLATPLPMFA RIIRLVDQRP LSALVIAHRR LLAMESSACV DHGWAWFCYG WAALHGEKVS EGLAALQQAQ ALFAQHHDQA GMWDCRQALL VGRWLQGEGV SLQQAWQPVI EAHLRLGGAR AAAEAQIYQL IHLNYLKRYQ AVLDLAATIA PHLTTAPQFV AGRFMRIVAI ANAGLGAFTK AKQGLDQALY ASQQAKAWVD VAKCLRERGF IADRQEHYAD AVADLQQAMT WFNRLGMPLY AALCQRALGL ALSRIGHYDQ GLRFNLAARS SFRMLDRPDL AAGCDQNIGV IAHYVRLPLI AQSAYQRALA VYQARSSTYD SCVLQRNFAL LQINQGNGQL ALDLLGAIQP LVLALDDQLE LGEFYEALAQ AWHCLGAVDQ AQRFFDQAIV CFEAIGNQIN IAKCQLGQAW LMLETGNWQL AQALLAQAQV WLLEHPTHRW RCHYGLGYCA AQAGASGRAM AEYIAACGIV AQLRQALSSE HASSAIFAQA QQLYHDTIRL ALAQANSNLA WQLIEQQRAL VLNRQMRCLP LAFDPDLAEE DQRYHARLSS LTQPTAGHEH VEALFADYIN FLIQARHTLE GSLADLAIDR PLEVVCGELD EAFDGDWTWL GYSQLGDDLL IITLYAGQIT VIRQPIDRRF CELLGLANSF TDHSILYEDW SFLSQAAPFA ELRALSDRLI PSVVKQRLHQ NHRLLITPCT KLHQVAWAAL LVNQQRLCQT CIPQIIPSLG TWSWLQARQS LGTEALLLGC DNFGERADAL PHIQAELRVV AQQVTIPVST LFGAEATGVA VLKLGQAGLL QRFRHIHIAT HAQLIAARGL LAHIKLVDGD MFYNDILNLR LAGATVVLST CDGSLSETLL GEEVLSLSRA FLAGGAREVL ANGWKTSDSG VVELMRLFYH YLAYPNDGAT ALAMAQRTLL ESDDPSQAAV LVWGGFQVVG AGTLAQWPSA QIPSISVGD
|
| |