Gene Cag_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_2010 
Symbol 
ID3747120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2549248 
End bp2550537 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content47% 
IMG OID637774547 
Producttransporter, putative 
Protein accessionYP_380301 
Protein GI78189963 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAAC CACCCGACAC TCCACCCTTC ATGGATACTG AAAGCAGCTC GCCGCAAGGC 
AAGAGCGCCA TAACTCGCTC TCTCCTTAAA GCCTTTCCCG CCTTTGCCAA CCCCGATTTT
CGCCGCTACT TTCCCGGTCA AGTTATTTCA ATGATTGGTA CATGGATGCA AATGGTGGCG
CAAGGGTGGT TGGTCTATGA ATTAACGGGC TCCGCTTTTG ATGTAGGAAT GGCTGCCGCC
GCAACTACCT TTCCCACACT TTTTCTCTCC CTTTTTGGAG GCTTGCTCGT TGATCGCTAT
CCACGCCGAA CCATCCTTTT TTGGACACAA TCGTCAGCCA TGTTGCTTGC CTTTATTTTG
GGCATAGTTA CCATGACGGG CACGGTAACT ATGGGCATTA TTTTATTGCT CTCCTTTTTG
TTGGGGTGCG TTAATGCCAT TAACGTGCCT GCACTCCAAG CCTTTTTGAG TGAAATTGTG
CGGCGCGATC ATCTCCCTTC GGCAATTGCC ATGAACTCCG CTATCTACAA TAGCTCACGA
GTGATTGGAC CAGCACTTGC AGGGTGGCTT ATTGCTTACA GTGGTGCAGG TATTGCCTTC
ATTGTTAATG GGTTTAGCTT TTTTGCAGTG CTCCTCTCCC TTTTTACCAT GAAAACTAAG
CGCCGTGCCC CAACGGTTAT TGAGAGCAAT CCACTGCTTG CAATTCGTGA AGGCGTGCTT
TACGCTTGGA ACCACAAACT CATTCGCCTC TGCATTTATT ACATCGCTAT TGTGTCGGTT
TTTTCATGGG CGTATGTAAG TATGTTGCCC GTTATTGCAA AACAGCGTTT TGGGATGGAT
GCCTCTGGCA TGGGCTCGCT TTTTGGAATT TCGGGTATTG GCTCGGTGAT GGGTACTATT
ATGGTTTCTA TGTTAGCCAA TAAAATTCAG CCACTCCGTT TTATTGCAAT AGGCTCCCTT
ATTTTTGCGG TGGCGCTACT TGGTTTTACG CTAACGGAGA ATTTGCCATT AGCAATGGTT
GGACTTTTTT TTGCAGGCTT CGGATTGGTG GCAGCCGTCT CAACCTTAAG CGCCACCATT
CAAGGTGCGG TTGAGGATCG TTTTCGAGGT AGGGTAATGA GTTTGTATAT GATGATTTTT
ATGGGTTTTA TGCCACTCGG AAATGTTACC ATTGGTTACC TTTCCGATTT GTTTGGAACA
GGTTTTGCAA TTCAGCTCAA TTGCATAGTA ACCATTATAG CCGCACTACT TTTGCTTGTG
CACAGCAAGC AGTTCCTCCG TATTGGGTGA
 
Protein sequence
MTQPPDTPPF MDTESSSPQG KSAITRSLLK AFPAFANPDF RRYFPGQVIS MIGTWMQMVA 
QGWLVYELTG SAFDVGMAAA ATTFPTLFLS LFGGLLVDRY PRRTILFWTQ SSAMLLAFIL
GIVTMTGTVT MGIILLLSFL LGCVNAINVP ALQAFLSEIV RRDHLPSAIA MNSAIYNSSR
VIGPALAGWL IAYSGAGIAF IVNGFSFFAV LLSLFTMKTK RRAPTVIESN PLLAIREGVL
YAWNHKLIRL CIYYIAIVSV FSWAYVSMLP VIAKQRFGMD ASGMGSLFGI SGIGSVMGTI
MVSMLANKIQ PLRFIAIGSL IFAVALLGFT LTENLPLAMV GLFFAGFGLV AAVSTLSATI
QGAVEDRFRG RVMSLYMMIF MGFMPLGNVT IGYLSDLFGT GFAIQLNCIV TIIAALLLLV
HSKQFLRIG