Gene Cag_0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0334 
Symbol 
ID3748054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp373499 
End bp375469 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content46% 
IMG OID637772861 
ProductATPase 
Protein accessionYP_378650 
Protein GI78188312 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAAG CGCGCAATCT CTCCCTTAGC ATTGGAACAA AACAACTGTT AAACGACACC 
TCCTTCCGCA TTGGCGACAC TGACCGTGTT GCGCTGGTGG GCTTAAACGG CACAGGCAAA
TCCACGCTGA TGCGCCTTAT TAGCAACACC TCTCCCGATA GCAGCACGCT ACGAGTGGGA
GGCGACTTCA TAAAATCAGC CGATACCACC ATTGGTTACT TGCCACAAGA GATTTCGTTT
GAAGATGATC TTGAAAAAAG TGCGCTTCAC TACGCCCTGC AAGCCAATAA AGAACTTTTT
GACCTTTCCG AAACCATTAC GCGCTTTGAG CACGAACTTG CTCTCCCCGA ACACGATTAC
GAAAGCGAAG CATATCACCG CTTAATTGAG CGTTTTTCCG ACGCCATGCA CAACTTTGAG
CGACTTGGTG GCTACACCAT GCAATCGGAT GCCGAAAAAG TGTTAGCAGG CTTAGGGTTT
AGTGAAATTG ATTTTCATAA AAAAGTAAAA GCTTTTTCAG GTGGCTGGCA AATGCGCCTG
CACATTGCAA AGCTACTTTT GCAAAACCCA ACCCTGCTTT TGCTTGACGA GCCAACCAAC
CACCTTGATA TTGACTCGTT ACGCTGGCTT GAAAACTATT TAACGAATTA CGAGCACAGC
TACATCATTA TTTCGCACGA TCGTTTTTTT CTCGATAAGC TCACTACACG CACACTTGAA
ATTGCCTTTG AGCGCATTAA CGAGTACAAA GGCAATTATT CAACCTACGA AAAAGAGAAG
GTTGAACGCT ATGAGCTTTT GATGAGCAAA TATCAAAACG ATTTAAAGAA AATGGCGGAG
CTGAACTCCT TTGTAGAGCG CTTTCGCTAC AAAGCCACCA AAGCACGTCA AGCCCAAAGT
CGCCTCAAGC AAATGGAAAA GCTTGAAAAA AACTTAGTGG CTCCCGAAGA GGATTTATCG
CAAATCTCTT TCCGCTTCCC CAAAGCACAG CCTTCAGGAC GAGAGGTAAT GCGTCTTGAC
GGAGTAAAAA AATCCTACAC ACTCCCTGAC GGCAGTCGTA AAGAGGTACT CAAACGGATT
GATTTGGAAA TTATGCGTGG CGACCGCATT GCCATTGTCG GCTCAAACGG TGCAGGAAAA
AGTACCTTTT GCAAAATTCT CGCCAATGAA TTGGATTACG AAGGCAAACT CACCACTGGG
CACAACGTAT CGCTCAACTA CTTCGCCCAA CACCAAACCG ACACCCTTGC AACCGAAAAA
AGCATCTACA TCGAAATGAT GGATTCGGCT CCAAATTCCG AAGCACAAAA AAAAGTACGC
GACATTCTTG GCTGCTTTTT GTTTAGCGGC GACACCGTCA ACAAAAAAAT TAAAGTGCTT
TCGGGAGGCG AAAAATCGCG CGTTGCACTT GCAAAAATTC TCTTGCAAGC CTCCAACCTG
CTCATTATGG ACGAGCCAAC CAACCATCTT GATATGCGCT CCAAAGAGAT GCTGATTGAG
TCGCTTGAAA ACTACGATGG CACGCTCTTG CTTGTTAGCC ACGACCGCTA CTTTCTTGAT
AGCCTTGTTA ACAAAGTGGT AGAAATTAAA AACGGCACCC TCCAACTCTA CTTAGGAACT
TACGCCGAGT ACCTTGAAAA ATCGGAAAAA ACGCGCCAAG CCGAAGAACA AGCCGAAGCG
CTTCAGCGCC AAAAAGAGCA AGCTGCCGCC AAAGCTGCAA TAAAAGCTGA AGAACAACGC
GCCGCTGCTG CAACGCCAGC GCCCGCAAAA GCTAAAAACA GCAAAAAATT AGAGGCTATT
GAGAAAAAAA TTAATCAGCT TGAGCAGCAA AAAGAGGAGA TGGAAAGAAT AATGGCTACG
GAGGATTTCT ATAAAAAAAG CAAGGAAGAA AATGCGCGCA CGCTTGAGCA TTACCACAAA
CTATGCGATG AACTAAATGC ACTGTTTGCG GAATGGGAGA CGTTGGGGTA A
 
Protein sequence
MFEARNLSLS IGTKQLLNDT SFRIGDTDRV ALVGLNGTGK STLMRLISNT SPDSSTLRVG 
GDFIKSADTT IGYLPQEISF EDDLEKSALH YALQANKELF DLSETITRFE HELALPEHDY
ESEAYHRLIE RFSDAMHNFE RLGGYTMQSD AEKVLAGLGF SEIDFHKKVK AFSGGWQMRL
HIAKLLLQNP TLLLLDEPTN HLDIDSLRWL ENYLTNYEHS YIIISHDRFF LDKLTTRTLE
IAFERINEYK GNYSTYEKEK VERYELLMSK YQNDLKKMAE LNSFVERFRY KATKARQAQS
RLKQMEKLEK NLVAPEEDLS QISFRFPKAQ PSGREVMRLD GVKKSYTLPD GSRKEVLKRI
DLEIMRGDRI AIVGSNGAGK STFCKILANE LDYEGKLTTG HNVSLNYFAQ HQTDTLATEK
SIYIEMMDSA PNSEAQKKVR DILGCFLFSG DTVNKKIKVL SGGEKSRVAL AKILLQASNL
LIMDEPTNHL DMRSKEMLIE SLENYDGTLL LVSHDRYFLD SLVNKVVEIK NGTLQLYLGT
YAEYLEKSEK TRQAEEQAEA LQRQKEQAAA KAAIKAEEQR AAAATPAPAK AKNSKKLEAI
EKKINQLEQQ KEEMERIMAT EDFYKKSKEE NARTLEHYHK LCDELNALFA EWETLG