Gene Cag_0400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0400 
Symbol 
ID3747778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp464414 
End bp465523 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content49% 
IMG OID637772928 
Productchlorophyllide reductase iron protein subunit X 
Protein accessionYP_378716 
Protein GI78188378 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1348] Nitrogenase subunit NifH (ATPase) 
TIGRFAM ID[TIGR02016] chlorophyllide reductase iron protein subunit X 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0534457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAA GAACCATAGC GATTTACGGC AAAGGTGGAA TAGGTAAAAG CTTTACCACC 
ACCAATCTTA GCGCCACCTT TGCCAGGATG AATAAGCGCG TGCTTCAGCT TGGTTGCGAT
CCCAAACACG ACTCCACCAC CTCGCTGTTT GGCGGCATTT CGCTGCCAAC CGTAACCGAT
GTGTTTGCGG CAAAAAATGC TAAAAACGAG CAAGTAGCCA TTAGCGACAT TGTTTTTCGC
CGCGATATTG AAGGCTTTCC TCAACCAATT TATGGCATTG AACTTGGCGG TCCACAAGTT
GGGCGCGGTT GCGGTGGACG CGGCATTATT TCAGGTTTTG ATGTGCTTGA AAAGCTCGGC
ATGTTCCAAT GGGATATTGA TATTATTCTT ATGGATTTTC TGGGCGATGT AGTGTGTGGA
GGTTTTGCAA CGCCGCTTGC CCGCTCACTT AGCGAAGAGG TAATTCTTGT AACCAGCAAC
GATCGTCAAG CTATTTTTAC AGCGAACAAC ATCTGCCAAG CAAATAACTA CTTCCGCACC
ATTGGCGGTG AATCGCACCT GCTTGGTATG ATTATCAATC GTGATGATGG TAGCGGTGTT
GCTGAAAACT ACGCACAAGC CGCAGGCATT AACGTGCTGA TGAAAGTGCC CTACAACATG
GAGGCACGCG ACCGCGATGA CAGCTTCGAC TTTGCTATAA AACTCCCCGA GCTTCGCGAC
AAATTCCAAA AGCTTGCAAC CGATATTCTT GAAAAGCGCA TTGCCCCCAG CAACGCCACA
GGGCTTGATT TCAACGACTT TGTGCGCCTT TTTGGCGACG TGAAAAACGA AGCGCCTCGT
CCCGCTAAAG CCGATGAGCT TTTTGCATCA CAACCCGCAG GCAACAACGC ATCCACCACC
ACTCATTCTA CCCAAGAGAG CGACCAGCAA AAAATGGAGC GCTGCATCGC TTGTCTTGAA
CCCATCCAGC AACAACTTTA CCGCCTCGCT GAGCTTGAGA AAAAAAGCCT CACCGACATT
GCATCCCTTA CCAATCTTGA CGAAACCACC ATCAGCGAAA CGCTTACACG CGCCCGCAAA
CAGCTCAAAC GCATGTTTTT TGAGGGATAA
 
Protein sequence
MKARTIAIYG KGGIGKSFTT TNLSATFARM NKRVLQLGCD PKHDSTTSLF GGISLPTVTD 
VFAAKNAKNE QVAISDIVFR RDIEGFPQPI YGIELGGPQV GRGCGGRGII SGFDVLEKLG
MFQWDIDIIL MDFLGDVVCG GFATPLARSL SEEVILVTSN DRQAIFTANN ICQANNYFRT
IGGESHLLGM IINRDDGSGV AENYAQAAGI NVLMKVPYNM EARDRDDSFD FAIKLPELRD
KFQKLATDIL EKRIAPSNAT GLDFNDFVRL FGDVKNEAPR PAKADELFAS QPAGNNASTT
THSTQESDQQ KMERCIACLE PIQQQLYRLA ELEKKSLTDI ASLTNLDETT ISETLTRARK
QLKRMFFEG