Gene Haur_5133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5133 
Symbol 
ID5737091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp183127 
End bp186303 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content47% 
IMG OID641282298 
Producthypothetical protein 
Protein accessionYP_001547889 
Protein GI159901643 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATT TGTATATCCC TGATGAACTG AGTGATAGTT TTCATGCAGC CATCCAAGCT 
CGAAGCCAAC TGAGTATAGA TATAGCACAG CAAACGTTTA CTTATGCGAT TCACCAGTGG
GAATCAATCC TTCAGATTTG TGCGAGGAAA GGCTATTCTG AACTGGGTGC AATTGCCCAT
AGTGAGATAG GACTTATTTT AGGGCATCGT TATCGAATAT ATGGTGATGA TAACGATCTT
CATCATGCAA CGAGGCTGCT AACACAATAT ATTCATTCAG TCCCTGATAC CTATATCGAA
AAGCCAAGGA TTTGTAATGG CTATGGCACC ATTTATCGCA ATCTCTATGA AGAAACTGGC
CAAATTGAAT ATCTCAACCA AGCGATCGCG ACGTTTGAAC ACTTTATTGG CAATACTCGT
CTGCATCCGT TGCACATAAG CGTTCTTCAC ACCGGATACG CGAACGTCTT ACTTTTTCGT
TTTGATATCT TTGATGATAT CCAGGATGTG TTTCAAGCCC TCGCTATTCA AAAAAAAGCG
TTGGAAGCCT GTGAACCGCG ATCACAACGA TGGGTCACTA CCAACGCCAT GCTCGCGAAC
TCTCTCTTAA GGCTCGCCAA ACGCGAAAAA AAGGCTGTGT TTCTTGATGA AGGGATATTT
TATGCAACCG AAGCCTTAGC GTACATTGAT CCATCGAACC CTCACTGGTT TAACTGCAAT
AACAATCTTG GACTAGCCTA CAGTTTCAGA TTTGAGGTAT CCAATCATAT TACGGATATC
ACTACTGCTA TCCATTATTA CCATACAGCC TTGCAGGCAC AGGCTATCTC GCCTCAAAAC
ACTGGCTTAG TGTGGAATAA TATTAGTGTG GCCTATCGAA CAAAATTCGA GACCTGGGGC
GACATCAGCG ATATTGACGC TGCGATTAGT GCATTACACC GTGCGCTCGG CGTGACGGCG
GCACCCGCCC CCCTGTGGAT TATGTGCAAA CATAATTTAG CAGCCAGCCT CATCCGTCGG
CATGAAATAC GGAAGCATCC GGTTGATATT CAGAAAGCTC TTTCGATCGT TACGGATGTC
CTCGGAATTA TGCCAGACTC GCTAGCAGGG AAAAGCGATT TCTATAATCT CGAAGCAAGC
ATCTATCACA CACAATATGA GCAAACGACA GATATTGCAG ATATTCGTCG GGCCACAACT
GCCGCGCAAG CAGGCCTCAA GCTGCCCAAT CCCGCGAATG AGTTATGGTG CATCTATGGA
CGCATATTGC TCAGTCGGTT TAAGCATGAG CAGCGGCCCG AAACCCTTGA AGAGGCCATT
CGGATCAGTC GGGAGGATGT TGCGAGGGGT CTCCCTCATA CCCACGGTTG GGCACGCAGT
TGTGATGTCC TGTGTTCAGC TTTATTTAGT AGATTTAAAA TGGTTGGCAG TTCGAACGAT
GCGGACTATC ATGAGCTGCT TAATCGCTAT GGAGCATTGC TTAACTACCC AGGATTACCC
TTGCACCATC GCTTGATCGT ATGCGGTAAT CTCGGGTATC TTCATATCGT CAAAAATAAG
TGGCGTGAAT CCTGTGATAC CCTCCTGCAG GGTATTGAGG TTGCGGACAC CCTCTATCTG
ACCCAGGCAA CAACCATCAA TCGTGAGCTA TGGAGCGCCA CCGCTGGTAA TATCTATCGT
CGTGCTGCGT ATGCTTTAGC GAACCTAGGC CGAATTGATG AAGCGGTCGT GATTCTTGAG
CGTGGACGGT CAAAGATCTT GGGTGATCAA CTCCAGCGTG AATCAGAGGA AGTTGCGTCC
TTAGAACGCG ATCATCCACA CCTCTATCAA GACTATATAG CAACCTCAGC CCGCTTACGC
AGAGTCGCCA ATCAAGAATG GGTATCCCGC CTCTATCGCG ATCATGAGAT GAACACCTAT
GATGAAGCCC GAGAAGCCCA AACGACGTTT CAATCCATGC TTCGCACTAT CCGAGCGCTG
CCGGGCTATG AATCATTTCT AGATACATTT TCCTATACCG ATATTATTGA GTGTCTCCAG
CCAGGTATGG CACTCGTCTA TATTGATGCA ACGATTGACT TCATGTATAC GATTGTCATC
GCCCGTTCCG ACCAGTCGTT TGATCTCCAT TATAGTGAAC TGCGAAATTT TTCTATTCCA
AAGTTAAAAA CATTACTGAT GAATCAAGAA GAAGAAGGCA TCTATGGTAG TTTTATGCGT
GGTCAACTCG AAAATCCTCG CGCTTTTTTG GGCCACTTAA GCGGTATTTT AGACGAGCTT
GGCGAGAATC TCATTAGTCC TATTGCTGCG TATCTGCACA CACAATACAT GACCGAGGTG
GTGCTCATTC CCGTTTTCCT CCTCAGGCCA CTTCCAGTAC ATGCTGCCCG TTATAATGGA
ACCTATTTTC AGGATGATTT TACCATTTCC TATAGTCCTT CTGCGCGGAT TTTCGCTATC
GCCAGTCGGC TCCAAGGTCG CCATGTCCAA CCACTCATTG CGATAGGAAA TCCGACCGGA
CAAGCGGGTT CAGCACTCTA TACCGATTGG TTAGCCGAGG AGTTTCAACG GATCGCTGGG
GGCGGAGAGG TTCTCTTACA TCACCATGCA ACCCTCCAGA ATGTTCTATC GGCCATAGGT
GAGCGAACCC CACGGCATAT CTTATTTGGG TGTCATGGAT GGTATGATGG TGATGAGCCA
CTCAAGTCCC ATCTGGTGCT GGCCAATACG AATCTGACCT TGACGGATGT AATGGCGAAT
CTTGACTTAG CGAAGACAGA TATGGTTATT TTAGTTTCCT GTAAAATGGG GGTCTTGGAT
TTTAAGCGCC TCAGTGAAGA AGTGCTTAAC TTTCCAATTG GCCTCTTATA TGCAGGGTGC
AAAACCGCAC TTGCTCCACT CTGGGCGGTG TATGCATTAC CGACAGTGTT GTTGCTTCAC
CAGATGTATG CGTGGATGAT AGCCGGCAGT TCATCCGCGA AGGCGCTCAG CGACGCAACA
CGCTGGCTGC GTACTCTTTC CCGTGCTGAG GCACTCCACG CTGTTGCGAT GCTCGTTCCC
TATGAAACGC AAGCGAGAAC CGCAGAGGAG ATGCTTCGTC CATTTCGGGG TGATCAGCCG
TTCGCAAATC CTGTGTATTG GGCAGCCTTT ACCCATTATG GCGCAGTGCT CAAATAA
 
Protein sequence
MNDLYIPDEL SDSFHAAIQA RSQLSIDIAQ QTFTYAIHQW ESILQICARK GYSELGAIAH 
SEIGLILGHR YRIYGDDNDL HHATRLLTQY IHSVPDTYIE KPRICNGYGT IYRNLYEETG
QIEYLNQAIA TFEHFIGNTR LHPLHISVLH TGYANVLLFR FDIFDDIQDV FQALAIQKKA
LEACEPRSQR WVTTNAMLAN SLLRLAKREK KAVFLDEGIF YATEALAYID PSNPHWFNCN
NNLGLAYSFR FEVSNHITDI TTAIHYYHTA LQAQAISPQN TGLVWNNISV AYRTKFETWG
DISDIDAAIS ALHRALGVTA APAPLWIMCK HNLAASLIRR HEIRKHPVDI QKALSIVTDV
LGIMPDSLAG KSDFYNLEAS IYHTQYEQTT DIADIRRATT AAQAGLKLPN PANELWCIYG
RILLSRFKHE QRPETLEEAI RISREDVARG LPHTHGWARS CDVLCSALFS RFKMVGSSND
ADYHELLNRY GALLNYPGLP LHHRLIVCGN LGYLHIVKNK WRESCDTLLQ GIEVADTLYL
TQATTINREL WSATAGNIYR RAAYALANLG RIDEAVVILE RGRSKILGDQ LQRESEEVAS
LERDHPHLYQ DYIATSARLR RVANQEWVSR LYRDHEMNTY DEAREAQTTF QSMLRTIRAL
PGYESFLDTF SYTDIIECLQ PGMALVYIDA TIDFMYTIVI ARSDQSFDLH YSELRNFSIP
KLKTLLMNQE EEGIYGSFMR GQLENPRAFL GHLSGILDEL GENLISPIAA YLHTQYMTEV
VLIPVFLLRP LPVHAARYNG TYFQDDFTIS YSPSARIFAI ASRLQGRHVQ PLIAIGNPTG
QAGSALYTDW LAEEFQRIAG GGEVLLHHHA TLQNVLSAIG ERTPRHILFG CHGWYDGDEP
LKSHLVLANT NLTLTDVMAN LDLAKTDMVI LVSCKMGVLD FKRLSEEVLN FPIGLLYAGC
KTALAPLWAV YALPTVLLLH QMYAWMIAGS SSAKALSDAT RWLRTLSRAE ALHAVAMLVP
YETQARTAEE MLRPFRGDQP FANPVYWAAF THYGAVLK