Gene Cag_1965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1965 
Symbol 
ID3747827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2494383 
End bp2495936 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content48% 
IMG OID637774501 
ProductFe-S-cluster-containing hydrogenase components 1-like 
Protein accessionYP_380256 
Protein GI78189918 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1
[COG3301] Formate-dependent nitrite reductase, membrane component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTACG GTTTTGTTAT AGATGCTCGC AAATGTATGG GATGTCATGG TTGCACGGTT 
GCGTGTAAAT CGGAGCATCA GGTGCCGCTT GGTGTGAATC GTACATGGGT GAAGTATGTG
GAAAAGGGGA TGTTCCCTGA AAGTCGTCGT TATTTTACGG TGTTGCGTTG CAACCATTGT
GCTGAGCCGC CTTGCGTGGA TATTTGCCCT GTGGAGGCGT TGCATAAGCG TGAGGATGGG
ATTGTGGATT TTGATAAGCG TCGTTGCATT GGTTGCAAGG CGTGTGCTCA AGCCTGCCCT
TATGGCGCGT TGTACATTGA TCCTGAAACG CACACGTCGG CAAAGTGTAA CTATTGTGCG
CATCGCAAAG AGGTGGGTAT GAAGCCTGCT TGTGTGGTGA TTTGCCCTCA GCAAGCAATT
GTTTCGGGCG ATTTTGATGA TCCCAATAGT GCGGTTTCCA TTTTGCTTGC TAAGAATCAA
ACTGCTGTTC GTAAGCCTGA AAAGGGCACT TCCCCTAAAC TCTTTTACAT TAATGGCGAT
GGTGCTTCGC TTGATCCTTT GGAGGTTTCA GCGGTTAATG GCTACCTCTG GAGTGAGCAG
TTGCGTGGTG TTGGTCATTT TGCGGGTAAG TCGTATGAAA AGCCTGCAAA GGCTACGGTT
CCTCAGTCGG GCAAGCATGG ACGTCCAAAG GCTCGCAAGG TGTATGATGT TCCATCAAAA
GGAGTTGTGT GGGGCTGGGA AGTTCCTGCC TATGTGTGGT CAAAAGCGGT CTCATCCGGT
CTTTTTTTAG TATTGGTGAT GATGCAGCTT TTTGTAGCAT CACTGCTTTC GGAATCTATG
CAGTGGGCAT CATGGGCAGC ATCGCTCTTT TTCCTTGCCT TAACGGGCGG CTTTTTAGTA
AAAGATCTTG ATCGTCCTGA GCGTTTTTCA TTTGTGATGT TGCGTCCTCA ATTTGACTCA
TGGCTTGTTA AGGGCGGTTT AACCATTACG GGTTTTGGGG CATTTCTTGC GTTGTGGGGA
GCGGGTTCAT TTTTACAGCT TCCACTGCTT ACTTCCATTG CACAAGTCGG GGGGACGCTT
TTTGCGGTTA TCACGGCGAT TTACACTGCA TTTCTTTTTG GCTCGGCTAA AGGGCGCGAT
TTTTGGCAAA GTCCTATGCT GCTGCTGCAC ATGTTGCTCA ATTCGCTGTT AGCCGGTGGT
TCTGCTATGC TTTTGCTTGG TGTGGTAACG GCAAGTAGCA ATGACCTTTT CTCCCTTTTA
CAGCCTTCGT TAGCTGCTGG TTTTGCCTTC CACCTTATTA TTATGGCATT AGAGCTGTTT
GGTAAACATC CCTCAGCCTC AGCCGAACGT GCGGCGGAAA CTATTCTGCA TGGCGAGCTA
AAACATCCAT TTTGGATTGG CTCGGTGCTG ATTGGTAACC TTATGCCTTT TGTGCTTTGC
CTTGTAGCCC CATCATCTTT TGGGCTTACA ATAGCCGCAT TAATGGCTCT GTGTGGTGTT
TTTTATACTG AAAAAGTGTG GGTACGTGCT CCACAAACAG TTCCTTTAAG TTAA
 
Protein sequence
MNYGFVIDAR KCMGCHGCTV ACKSEHQVPL GVNRTWVKYV EKGMFPESRR YFTVLRCNHC 
AEPPCVDICP VEALHKREDG IVDFDKRRCI GCKACAQACP YGALYIDPET HTSAKCNYCA
HRKEVGMKPA CVVICPQQAI VSGDFDDPNS AVSILLAKNQ TAVRKPEKGT SPKLFYINGD
GASLDPLEVS AVNGYLWSEQ LRGVGHFAGK SYEKPAKATV PQSGKHGRPK ARKVYDVPSK
GVVWGWEVPA YVWSKAVSSG LFLVLVMMQL FVASLLSESM QWASWAASLF FLALTGGFLV
KDLDRPERFS FVMLRPQFDS WLVKGGLTIT GFGAFLALWG AGSFLQLPLL TSIAQVGGTL
FAVITAIYTA FLFGSAKGRD FWQSPMLLLH MLLNSLLAGG SAMLLLGVVT ASSNDLFSLL
QPSLAAGFAF HLIIMALELF GKHPSASAER AAETILHGEL KHPFWIGSVL IGNLMPFVLC
LVAPSSFGLT IAALMALCGV FYTEKVWVRA PQTVPLS