Gene Haur_3689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3689 
Symbol 
ID5735538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4641038 
End bp4642402 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content53% 
IMG OID641280841 
Productcyclic nucleotide-binding protein 
Protein accessionYP_001546453 
Protein GI159900206 
COG category[C] Energy production and conversion 
COG ID[COG1142] Fe-S-cluster-containing hydrogenase components 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.120597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCCTC TCGCGCTCAA TCTCCAAGTT GATGCTTTGC AGAGCCTGCC CAATTTGGCC 
GAATTACCCA CCAACGAAGC CACAATTTTG GCGCGAATTG GGGTTTTTCG GGCTTTTGCG
GCAGGCGACA CGATTCCGAT TGCCCGTTTA CGCAGTAGCC AATGCTATGT GATTTTGAGC
GGCGTGGCTG ATACAGTGAT TCTTGATCGC GATGGTGAGC CGATTTCAAT TGGCGAGTTG
GGCGAGGGCG ATTTTTTTGG CAATAGCATC TTTTTTAGCT CACATTCGCT GTTGTATGCC
GTCCAAGCCC AAACCCAAAT CTTCGCGTTG CAATGGTCGA TTGAGCGTTT GCACGAAAAA
AAGCAACATC TGCCACTATT TATGCGTTTG CTCGAAGCGA GCTATTTGCA ACGCCGCGCG
GTTAGTGCCC TGAGCCGTGT GCCGTTGTTT AGTCATGTCA GCGTTGAAGA ACGGGCCTTG
TTGGCAACCC AACTCACGCG CCAAGAATTT GGCCGTAACA CCGTGATTTT TGAGCAAGGC
TCGGCTGGCC AAGCCTTATA TTTGATCGAA CAAGGCCAAA TTGCCGTTGA GCAACATGGC
GTGATTGTAG CAACCCTCAG CGATGGTGAT TTTTTTGGCG AGATGGCTTT GCTCTCGGCA
ACACCGCACA ATGCCACCTT ACGTTGTCTA ACCCCAACCC GCTGCTTGCA CCTGCCGGGT
GCGGTTTTTG CGGCCCAAGT TGCCCAACAT CCTTCGCTTG AAGCGGCGGT ACGGCGGGTG
ATCGATGAAC GGGTGCATCA CTCGGAGCGA GTACGCGGCG ACCAAACTCG TCAGCATTTG
ATCAAAGTGG CGGTGCGCTA TGGCATGTTT CGTGGCTCGC ATGTGTTGGT GCGCCAGCCC
GCGCAATGCC CGCCCGATTG CCGAATTTGT GAGCAGGCTT GTGCTGAGCG TTTTGGCCAA
ACCCGTATGC GGCTCAACGG CGCTAAAATC GAAGATTGGG ATATTACTCA GAGTTGTCGG
CAGTGTCGAG TTGGAGCCGA GTGCGTTGAG GCATGTCCCG AAGCTGCGAT TCAATGGGAT
GATAATGGGG CGTTACGGAT TACTGATGCT TGCACTGGTT GCAACGAGTG TGTGCTGGCC
TGCCCTTATG ATGCGGTTGA ATCGCAAACG ATCTTTTTAC AGAACCAGCA AGGGCCACTT
TGGCAGCTTT GGCAGCGGAT GCGCCAGCAA TCACATCAAA TTCAGCCCAA AACCGTGGCT
AGTAAATGCG ATTTATGCGC AGGCTATGAT GATCGGGCTT GTTTGAGCCA ATGCCCAACT
GGCTCGTTGC AATTAATCTC AATCGAAGAG CTATTTCCCT TTTGA
 
Protein sequence
MAPLALNLQV DALQSLPNLA ELPTNEATIL ARIGVFRAFA AGDTIPIARL RSSQCYVILS 
GVADTVILDR DGEPISIGEL GEGDFFGNSI FFSSHSLLYA VQAQTQIFAL QWSIERLHEK
KQHLPLFMRL LEASYLQRRA VSALSRVPLF SHVSVEERAL LATQLTRQEF GRNTVIFEQG
SAGQALYLIE QGQIAVEQHG VIVATLSDGD FFGEMALLSA TPHNATLRCL TPTRCLHLPG
AVFAAQVAQH PSLEAAVRRV IDERVHHSER VRGDQTRQHL IKVAVRYGMF RGSHVLVRQP
AQCPPDCRIC EQACAERFGQ TRMRLNGAKI EDWDITQSCR QCRVGAECVE ACPEAAIQWD
DNGALRITDA CTGCNECVLA CPYDAVESQT IFLQNQQGPL WQLWQRMRQQ SHQIQPKTVA
SKCDLCAGYD DRACLSQCPT GSLQLISIEE LFPF