Gene Haur_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1984 
Symbol 
ID5733873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2438817 
End bp2441696 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content51% 
IMG OID641279128 
Producthypothetical protein 
Protein accessionYP_001544755 
Protein GI159898508 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGGCGA CCCCGCTACC AATGTTTGCT CGAATCATCC GTTTGGTCGA TCAGCGGCCA 
TTAAGTGCCC TCGTGATCGC GCATCGGCGG CTGCTCGCGA TGGAATCAAG CGCGTGTGTA
GACCATGGTT GGGCTTGGTT TTGTTATGGC TGGGCCGCGC TGCATGGCGA AAAAGTTTCC
GAAGGCCTAG CCGCGTTGCA ACAAGCGCAA GCTCTTTTCG CCCAACATCA TGATCAAGCG
GGCATGTGGG ATTGTCGTCA AGCGTTGCTG GTTGGTCGAT GGCTACAAGG AGAAGGGGTT
TCGTTGCAAC AGGCATGGCA ACCAGTCATC GAAGCCCATT TGCGACTTGG TGGTGCTCGC
GCCGCTGCCG AGGCCCAAAT TTATCAATTG ATTCATCTTA ATTACCTTAA ACGCTATCAA
GCTGTCCTTG ATCTGGCTGC CACAATCGCG CCGCATCTGA CCACTGCTCC GCAATTCGTG
GCTGGTCGGT TTATGCGTAT TGTGGCGATT GCCAATGCAG GCCTTGGCGC TTTTACCAAA
GCCAAGCAGG GTTTAGATCA GGCATTGTAC GCTTCTCAAC AAGCCAAGGC GTGGGTTGAT
GTAGCTAAAT GCCTTCGTGA GCGGGGCTTT ATTGCTGATC GTCAAGAACA CTATGCAGAT
GCCGTAGCCG ACCTTCAACA GGCGATGACC TGGTTTAATC GTTTGGGCAT GCCACTATAT
GCGGCCTTAT GCCAGCGAGC ACTTGGCTTG GCACTAAGCA GAATTGGTCA CTACGATCAG
GGTTTGCGGT TTAACCTTGC AGCGCGTAGC AGCTTTCGTA TGCTTGATCG GCCCGATTTG
GCGGCTGGAT GCGACCAAAA TATTGGCGTG ATTGCCCATT ATGTGCGGCT GCCGCTCATT
GCTCAATCGG CCTATCAGCG AGCGTTGGCG GTCTATCAAG CGCGTTCGAG TACCTATGAT
AGTTGTGTGC TTCAGCGCAA TTTTGCCCTG TTACAGATTA ATCAAGGCAA TGGTCAGCTT
GCGCTCGACT TGCTCGGCGC GATTCAACCT TTAGTGCTAG CCTTAGATGA TCAGTTAGAG
CTTGGCGAGT TTTACGAGGC ACTTGCACAG GCGTGGCACT GCCTTGGTGC TGTTGACCAA
GCTCAGCGCT TTTTCGATCA GGCAATTGTT TGTTTTGAGG CGATTGGCAA TCAGATTAAT
ATTGCCAAAT GTCAGCTTGG TCAAGCTTGG TTGATGCTTG AAACAGGCAA TTGGCAGCTA
GCACAGGCTT TGTTAGCGCA AGCTCAGGTG TGGTTGCTTG AACATCCGAC CCATCGCTGG
CGTTGTCATT ATGGCTTAGG GTATTGTGCT GCACAGGCTG GGGCTAGCGG GCGGGCCATG
GCAGAATATA TCGCTGCTTG TGGCATTGTT GCCCAACTTC GTCAAGCTTT GAGTAGCGAA
CATGCTTCGA GTGCGATTTT TGCCCAAGCG CAGCAACTGT ATCACGATAC CATTCGATTG
GCGCTTGCCC AGGCCAATAG TAACTTGGCA TGGCAATTAA TCGAGCAACA ACGAGCATTG
GTGTTAAATC GCCAAATGCG TTGTTTACCA CTAGCGTTTG ACCCAGACTT GGCAGAGGAA
GATCAGCGCT ATCATGCCCG TTTGAGTAGC TTAACGCAGC CAACTGCTGG TCATGAGCAC
GTAGAGGCAC TATTCGCCGA TTATATCAAT TTCTTAATTC AAGCGCGACA TACCCTTGAA
GGTTCGCTTG CCGATTTGGC GATTGATCGG CCACTTGAGG TTGTTTGTGG CGAACTTGAT
GAAGCCTTTG ATGGGGATTG GACATGGCTC GGCTACAGTC AATTAGGCGA CGATCTGCTG
ATTATCACGC TCTATGCTGG GCAGATTACG GTTATTCGCC AGCCAATTGA TCGGCGTTTC
TGTGAGTTAT TAGGCTTGGC CAACTCGTTT ACTGACCATT CAATTTTGTA TGAGGATTGG
TCGTTTCTGA GCCAAGCTGC TCCTTTCGCC GAGCTTCGGG CCTTATCTGA TCGACTTATC
CCTAGCGTCG TTAAGCAACG CTTGCATCAA AACCATCGCC TATTAATCAC ACCATGTACC
AAGCTGCATC AGGTTGCTTG GGCAGCTTTG CTGGTCAACC AACAGCGACT TTGCCAAACC
TGTATTCCAC AGATTATTCC ATCGTTGGGC ACATGGTCAT GGTTGCAGGC ACGTCAAAGC
TTGGGCACTG AGGCGTTATT GCTGGGCTGT GACAATTTTG GCGAGCGAGC GGACGCGTTG
CCACACATTC AAGCAGAACT GCGGGTTGTT GCGCAACAAG TAACCATCCC CGTCAGCACG
CTGTTTGGAG CCGAAGCAAC AGGCGTGGCC GTGTTGAAGC TAGGTCAAGC TGGGTTATTA
CAACGCTTTC GGCATATTCA TATTGCAACC CATGCGCAAT TAATCGCTGC CCGTGGGTTA
CTTGCCCATA TTAAGCTTGT GGATGGCGAT ATGTTTTACA ACGATATCCT CAATTTACGG
CTTGCTGGGG CGACGGTGGT GTTATCGACA TGTGATGGCT CGCTGAGTGA AACGTTGCTT
GGCGAAGAAG TGCTAAGTTT GAGCCGCGCT TTTTTGGCTG GTGGCGCACG TGAGGTGCTA
GCCAATGGGT GGAAAACTAG CGATAGCGGG GTGGTTGAGT TGATGCGGTT ATTTTATCAC
TATTTAGCCT ACCCAAATGA TGGGGCAACG GCTTTGGCAA TGGCCCAACG CACATTACTT
GAATCTGACG ATCCCAGCCA AGCTGCAGTC TTGGTCTGGG GTGGCTTTCA GGTTGTTGGG
GCTGGAACAC TGGCGCAATG GCCATCTGCG CAGATTCCGT CGATCAGCGT CGGTGATTAA
 
Protein sequence
MLATPLPMFA RIIRLVDQRP LSALVIAHRR LLAMESSACV DHGWAWFCYG WAALHGEKVS 
EGLAALQQAQ ALFAQHHDQA GMWDCRQALL VGRWLQGEGV SLQQAWQPVI EAHLRLGGAR
AAAEAQIYQL IHLNYLKRYQ AVLDLAATIA PHLTTAPQFV AGRFMRIVAI ANAGLGAFTK
AKQGLDQALY ASQQAKAWVD VAKCLRERGF IADRQEHYAD AVADLQQAMT WFNRLGMPLY
AALCQRALGL ALSRIGHYDQ GLRFNLAARS SFRMLDRPDL AAGCDQNIGV IAHYVRLPLI
AQSAYQRALA VYQARSSTYD SCVLQRNFAL LQINQGNGQL ALDLLGAIQP LVLALDDQLE
LGEFYEALAQ AWHCLGAVDQ AQRFFDQAIV CFEAIGNQIN IAKCQLGQAW LMLETGNWQL
AQALLAQAQV WLLEHPTHRW RCHYGLGYCA AQAGASGRAM AEYIAACGIV AQLRQALSSE
HASSAIFAQA QQLYHDTIRL ALAQANSNLA WQLIEQQRAL VLNRQMRCLP LAFDPDLAEE
DQRYHARLSS LTQPTAGHEH VEALFADYIN FLIQARHTLE GSLADLAIDR PLEVVCGELD
EAFDGDWTWL GYSQLGDDLL IITLYAGQIT VIRQPIDRRF CELLGLANSF TDHSILYEDW
SFLSQAAPFA ELRALSDRLI PSVVKQRLHQ NHRLLITPCT KLHQVAWAAL LVNQQRLCQT
CIPQIIPSLG TWSWLQARQS LGTEALLLGC DNFGERADAL PHIQAELRVV AQQVTIPVST
LFGAEATGVA VLKLGQAGLL QRFRHIHIAT HAQLIAARGL LAHIKLVDGD MFYNDILNLR
LAGATVVLST CDGSLSETLL GEEVLSLSRA FLAGGAREVL ANGWKTSDSG VVELMRLFYH
YLAYPNDGAT ALAMAQRTLL ESDDPSQAAV LVWGGFQVVG AGTLAQWPSA QIPSISVGD