Gene Cag_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1921 
Symbol 
ID3747296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2453687 
End bp2455000 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content48% 
IMG OID637774456 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_380212 
Protein GI78189874 
COG category[R] General function prediction only 
COG ID[COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000266613 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTTTC AGTGCGTTAT GACAAATCAT CAGCTTTCAC GGCGCGATTT TGCAAAGTTG 
TTACTGTCGA GCACGGCAGG CGCTTTGCTT GGGGTTGGTG TGCCATCGTC GCGTACGTAT
GCGGCAACCA ATCGTGTAGT TATTATTGGT GGTGGGTTTG GTGGAGCTAC GGCAGCAAAG
TATCTCCGCA AGCTTGATCC GTCGGTTGCT ATTACGTTGG TTGAGCCAAA ATGCCAATTT
TATACCTGCC CTATTAGCAA TTGGGTGATT GCTGGTTTAA AGCCAATGCA CGCTATTGCG
CAAAATTATA ATGCGTTAAG GGTGCGTTAT GGTGTAAATG TGGTACACGC TACGGCGGTG
GCTATTGATG CCTTAAAAAA CAGCGTTACC CTGCATAGTG GCAAAAAGCT CTTTTATGAT
CGTTTAATTG TGTCGCCCGG TATTGATTTC CGTTGGAATG CAATTCCTGG TTATAGCCAA
AAGGTGGCTG AAAGTGTTAT GCCGCATGGT TTTCAAGCGG GTGAGCAAAC GTTGCTGTTG
CGCAAGCAAT TGCTGGCAAT GCCGAATGGT GGCACGGTAA TTATGTGCCC TCCCAACAAT
CCGCACCGTT GCCCTGCTGC ACCTTATGAG CGTGCAAGTT TAATAGCACA TTATTTAAAG
CAGCACAAGC CAAAGTCGAA GGTGCTCATT CTTGATTGCA AGGAGAAGTT TTCCAAACAA
GAGCTTTTTT TGCAGGGTTG GGAGCGTTTG TATTCGGGAA TGATTGAATG GCGTGCGGCA
ACGGCTGGCG GTAAAGTGGA GGCGGTGAAT AGTGCGGCTA TGACGGTTAC CACCGAGTTT
GGTGATGAAA AGGGTGACCT TATTAATATT ATGCCACCGC AACAAGCGGG TCGCATTGCT
TTTGAGGCTG GTTTAACCGA TGCCGCTGGT TGGTGCCCTG TGCATCCCAT TACCTTTGAA
TCAACGCTGC ATCCCGGCAT TCACATTATT GGCGATGCTT GCCACGCGGG TGATATGCCA
AAATCAGCAT TTGCCTCAAG TAGCCAAGGG AAGGTGGCAA GTTCGGCTAT TGCTGCATTA
CTGCAAGGCA GAGTGCCTGT TGCGCCATCA TTAGTAAGCA CCTGTTACAG CTTACTTAAG
CCCGATTACG CTATTTCGGT AGCTAATGTT TTTCGTTTAA CGATTGACGG TATTGTTGAT
GTAAAAGGTT CGGGTGGCGT TACCCCGCTT GATGCGTCGG TGGAACATTT GCAACACGAA
GCCGATTTTG CATGGGGATG GTACGAAAAC ATTACACGCG ATACGTGGGG CTAA
 
Protein sequence
MFFQCVMTNH QLSRRDFAKL LLSSTAGALL GVGVPSSRTY AATNRVVIIG GGFGGATAAK 
YLRKLDPSVA ITLVEPKCQF YTCPISNWVI AGLKPMHAIA QNYNALRVRY GVNVVHATAV
AIDALKNSVT LHSGKKLFYD RLIVSPGIDF RWNAIPGYSQ KVAESVMPHG FQAGEQTLLL
RKQLLAMPNG GTVIMCPPNN PHRCPAAPYE RASLIAHYLK QHKPKSKVLI LDCKEKFSKQ
ELFLQGWERL YSGMIEWRAA TAGGKVEAVN SAAMTVTTEF GDEKGDLINI MPPQQAGRIA
FEAGLTDAAG WCPVHPITFE STLHPGIHII GDACHAGDMP KSAFASSSQG KVASSAIAAL
LQGRVPVAPS LVSTCYSLLK PDYAISVANV FRLTIDGIVD VKGSGGVTPL DASVEHLQHE
ADFAWGWYEN ITRDTWG