Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5133 |
Symbol | |
ID | 5737091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 183127 |
End bp | 186303 |
Gene Length | 3177 bp |
Protein Length | 1058 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641282298 |
Product | hypothetical protein |
Protein accession | YP_001547889 |
Protein GI | 159901643 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGATT TGTATATCCC TGATGAACTG AGTGATAGTT TTCATGCAGC CATCCAAGCT CGAAGCCAAC TGAGTATAGA TATAGCACAG CAAACGTTTA CTTATGCGAT TCACCAGTGG GAATCAATCC TTCAGATTTG TGCGAGGAAA GGCTATTCTG AACTGGGTGC AATTGCCCAT AGTGAGATAG GACTTATTTT AGGGCATCGT TATCGAATAT ATGGTGATGA TAACGATCTT CATCATGCAA CGAGGCTGCT AACACAATAT ATTCATTCAG TCCCTGATAC CTATATCGAA AAGCCAAGGA TTTGTAATGG CTATGGCACC ATTTATCGCA ATCTCTATGA AGAAACTGGC CAAATTGAAT ATCTCAACCA AGCGATCGCG ACGTTTGAAC ACTTTATTGG CAATACTCGT CTGCATCCGT TGCACATAAG CGTTCTTCAC ACCGGATACG CGAACGTCTT ACTTTTTCGT TTTGATATCT TTGATGATAT CCAGGATGTG TTTCAAGCCC TCGCTATTCA AAAAAAAGCG TTGGAAGCCT GTGAACCGCG ATCACAACGA TGGGTCACTA CCAACGCCAT GCTCGCGAAC TCTCTCTTAA GGCTCGCCAA ACGCGAAAAA AAGGCTGTGT TTCTTGATGA AGGGATATTT TATGCAACCG AAGCCTTAGC GTACATTGAT CCATCGAACC CTCACTGGTT TAACTGCAAT AACAATCTTG GACTAGCCTA CAGTTTCAGA TTTGAGGTAT CCAATCATAT TACGGATATC ACTACTGCTA TCCATTATTA CCATACAGCC TTGCAGGCAC AGGCTATCTC GCCTCAAAAC ACTGGCTTAG TGTGGAATAA TATTAGTGTG GCCTATCGAA CAAAATTCGA GACCTGGGGC GACATCAGCG ATATTGACGC TGCGATTAGT GCATTACACC GTGCGCTCGG CGTGACGGCG GCACCCGCCC CCCTGTGGAT TATGTGCAAA CATAATTTAG CAGCCAGCCT CATCCGTCGG CATGAAATAC GGAAGCATCC GGTTGATATT CAGAAAGCTC TTTCGATCGT TACGGATGTC CTCGGAATTA TGCCAGACTC GCTAGCAGGG AAAAGCGATT TCTATAATCT CGAAGCAAGC ATCTATCACA CACAATATGA GCAAACGACA GATATTGCAG ATATTCGTCG GGCCACAACT GCCGCGCAAG CAGGCCTCAA GCTGCCCAAT CCCGCGAATG AGTTATGGTG CATCTATGGA CGCATATTGC TCAGTCGGTT TAAGCATGAG CAGCGGCCCG AAACCCTTGA AGAGGCCATT CGGATCAGTC GGGAGGATGT TGCGAGGGGT CTCCCTCATA CCCACGGTTG GGCACGCAGT TGTGATGTCC TGTGTTCAGC TTTATTTAGT AGATTTAAAA TGGTTGGCAG TTCGAACGAT GCGGACTATC ATGAGCTGCT TAATCGCTAT GGAGCATTGC TTAACTACCC AGGATTACCC TTGCACCATC GCTTGATCGT ATGCGGTAAT CTCGGGTATC TTCATATCGT CAAAAATAAG TGGCGTGAAT CCTGTGATAC CCTCCTGCAG GGTATTGAGG TTGCGGACAC CCTCTATCTG ACCCAGGCAA CAACCATCAA TCGTGAGCTA TGGAGCGCCA CCGCTGGTAA TATCTATCGT CGTGCTGCGT ATGCTTTAGC GAACCTAGGC CGAATTGATG AAGCGGTCGT GATTCTTGAG CGTGGACGGT CAAAGATCTT GGGTGATCAA CTCCAGCGTG AATCAGAGGA AGTTGCGTCC TTAGAACGCG ATCATCCACA CCTCTATCAA GACTATATAG CAACCTCAGC CCGCTTACGC AGAGTCGCCA ATCAAGAATG GGTATCCCGC CTCTATCGCG ATCATGAGAT GAACACCTAT GATGAAGCCC GAGAAGCCCA AACGACGTTT CAATCCATGC TTCGCACTAT CCGAGCGCTG CCGGGCTATG AATCATTTCT AGATACATTT TCCTATACCG ATATTATTGA GTGTCTCCAG CCAGGTATGG CACTCGTCTA TATTGATGCA ACGATTGACT TCATGTATAC GATTGTCATC GCCCGTTCCG ACCAGTCGTT TGATCTCCAT TATAGTGAAC TGCGAAATTT TTCTATTCCA AAGTTAAAAA CATTACTGAT GAATCAAGAA GAAGAAGGCA TCTATGGTAG TTTTATGCGT GGTCAACTCG AAAATCCTCG CGCTTTTTTG GGCCACTTAA GCGGTATTTT AGACGAGCTT GGCGAGAATC TCATTAGTCC TATTGCTGCG TATCTGCACA CACAATACAT GACCGAGGTG GTGCTCATTC CCGTTTTCCT CCTCAGGCCA CTTCCAGTAC ATGCTGCCCG TTATAATGGA ACCTATTTTC AGGATGATTT TACCATTTCC TATAGTCCTT CTGCGCGGAT TTTCGCTATC GCCAGTCGGC TCCAAGGTCG CCATGTCCAA CCACTCATTG CGATAGGAAA TCCGACCGGA CAAGCGGGTT CAGCACTCTA TACCGATTGG TTAGCCGAGG AGTTTCAACG GATCGCTGGG GGCGGAGAGG TTCTCTTACA TCACCATGCA ACCCTCCAGA ATGTTCTATC GGCCATAGGT GAGCGAACCC CACGGCATAT CTTATTTGGG TGTCATGGAT GGTATGATGG TGATGAGCCA CTCAAGTCCC ATCTGGTGCT GGCCAATACG AATCTGACCT TGACGGATGT AATGGCGAAT CTTGACTTAG CGAAGACAGA TATGGTTATT TTAGTTTCCT GTAAAATGGG GGTCTTGGAT TTTAAGCGCC TCAGTGAAGA AGTGCTTAAC TTTCCAATTG GCCTCTTATA TGCAGGGTGC AAAACCGCAC TTGCTCCACT CTGGGCGGTG TATGCATTAC CGACAGTGTT GTTGCTTCAC CAGATGTATG CGTGGATGAT AGCCGGCAGT TCATCCGCGA AGGCGCTCAG CGACGCAACA CGCTGGCTGC GTACTCTTTC CCGTGCTGAG GCACTCCACG CTGTTGCGAT GCTCGTTCCC TATGAAACGC AAGCGAGAAC CGCAGAGGAG ATGCTTCGTC CATTTCGGGG TGATCAGCCG TTCGCAAATC CTGTGTATTG GGCAGCCTTT ACCCATTATG GCGCAGTGCT CAAATAA
|
Protein sequence | MNDLYIPDEL SDSFHAAIQA RSQLSIDIAQ QTFTYAIHQW ESILQICARK GYSELGAIAH SEIGLILGHR YRIYGDDNDL HHATRLLTQY IHSVPDTYIE KPRICNGYGT IYRNLYEETG QIEYLNQAIA TFEHFIGNTR LHPLHISVLH TGYANVLLFR FDIFDDIQDV FQALAIQKKA LEACEPRSQR WVTTNAMLAN SLLRLAKREK KAVFLDEGIF YATEALAYID PSNPHWFNCN NNLGLAYSFR FEVSNHITDI TTAIHYYHTA LQAQAISPQN TGLVWNNISV AYRTKFETWG DISDIDAAIS ALHRALGVTA APAPLWIMCK HNLAASLIRR HEIRKHPVDI QKALSIVTDV LGIMPDSLAG KSDFYNLEAS IYHTQYEQTT DIADIRRATT AAQAGLKLPN PANELWCIYG RILLSRFKHE QRPETLEEAI RISREDVARG LPHTHGWARS CDVLCSALFS RFKMVGSSND ADYHELLNRY GALLNYPGLP LHHRLIVCGN LGYLHIVKNK WRESCDTLLQ GIEVADTLYL TQATTINREL WSATAGNIYR RAAYALANLG RIDEAVVILE RGRSKILGDQ LQRESEEVAS LERDHPHLYQ DYIATSARLR RVANQEWVSR LYRDHEMNTY DEAREAQTTF QSMLRTIRAL PGYESFLDTF SYTDIIECLQ PGMALVYIDA TIDFMYTIVI ARSDQSFDLH YSELRNFSIP KLKTLLMNQE EEGIYGSFMR GQLENPRAFL GHLSGILDEL GENLISPIAA YLHTQYMTEV VLIPVFLLRP LPVHAARYNG TYFQDDFTIS YSPSARIFAI ASRLQGRHVQ PLIAIGNPTG QAGSALYTDW LAEEFQRIAG GGEVLLHHHA TLQNVLSAIG ERTPRHILFG CHGWYDGDEP LKSHLVLANT NLTLTDVMAN LDLAKTDMVI LVSCKMGVLD FKRLSEEVLN FPIGLLYAGC KTALAPLWAV YALPTVLLLH QMYAWMIAGS SSAKALSDAT RWLRTLSRAE ALHAVAMLVP YETQARTAEE MLRPFRGDQP FANPVYWAAF THYGAVLK
|
| |