Gene Cagg_2155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2155 
Symbol 
ID7267663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2644826 
End bp2646109 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content55% 
IMG OID643566987 
Productglucose/sorbosone dehydrogenase-like protein 
Protein accessionYP_002463475 
Protein GI219849042 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00567521 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCTACA CAATTTGGTT CATCGTTCTT GGTACTATTG CTCTAACGGC TTGTACCGTC 
GCCTCCAATA ACCCAACAGC ACAACCCACC TTGCCGCCCA CGACACCGGT CGCCACACAA
ACCGTTCCTA TCCCAACCGT GCCACCAACC TCGGTACCTA TACAGCCATC GGTAGATCCA
CCCACATCCA CACCGACAGC AGCCATCGAT CCGATCACCC TCACCTATTC GCTCGAGAAG
ATTGCCGATA ACTTTCGTCG TCCAACCCAC CTCACGAACG CTGGAGATGG TAGTCGGCGC
TTATTTGTGG TCGAACAGGA AGGACAGATT TGGGTTATAT ACGATGGACA ACGGCTCAGT
GAACCGTTTC TCGATCTGCG CGCGCAAGTG GGATCGCGCG GAAACGAGCA GGGCTTACTG
AGCATCGCCT TCCATCCCCA ATTTGCCAAC AATGGTCGCT TTTTTGTCAA TTATACCGAC
CGAAATGGTG ATACGGTCGT TGCTGAATAC CGAGTCAGTA CCGATCCCAA TCGGGCCGAT
CCGGCGAGTG GCCGCGAATT GCTACGGATC GACCAGCCGG CAGCCAATCA CAACGGCGGT
TTACTCTTGT TTGGCCCAGA TGGTTATCTC TATATCGGTA CAGGTGATGG TGGCGGCGCC
GGCGACCCAC TCGACGCCGG GCAACGGCTC GATACGCTGT TAGGTAAACT CTTACGGATC
GATGTTGATA AGGGCCAGCC GTATGCTATT CCTGCCGATA ACCCCTTCCT CAACCGCAAC
GGTGCTTTAC CGGAGATTTG GGCTTACGGC TTACGTAACC CATGGCGTTT TACCTTCGAC
GCGGTTGATA ACATTCTCTT TATCGCCGAT GTTGGTCAAA ATGCGTGGGA AGAGGTCAAT
GCTGTACCGG CGAATGCTGC CGGCCTCAAT TATGGTTGGC GATTGATGGA AGGTGAACAA
TGCTACCGAC CGGCGACGTG TGATCCGAGT GGGTTGGTCA TGCCGGTCAC CGTCTATCCA
CACGACAGCG CCATCGGTGG TTGTTCGGTA ACCGGCGGTG AAGTGTATCG CGGTATACGC
CAACCGGCAC TCACCGGCGT CTACTTCTAT GCCGACTTTT GTACCGGTAA TCTGTGGGCT
TTGTGGAGGA ATACGGGCGA ATGGCGACAC GCACTGGTGG CACGCCTGAA CCTTCAGACG
ACTTCGTTTG GGTTAGATGA GGATGGCGAA ATCTACCTGC TCGACCGTGC CGGCAGTGTG
TACCGACTCG TAGCGGGTGA GTGA
 
Protein sequence
MRYTIWFIVL GTIALTACTV ASNNPTAQPT LPPTTPVATQ TVPIPTVPPT SVPIQPSVDP 
PTSTPTAAID PITLTYSLEK IADNFRRPTH LTNAGDGSRR LFVVEQEGQI WVIYDGQRLS
EPFLDLRAQV GSRGNEQGLL SIAFHPQFAN NGRFFVNYTD RNGDTVVAEY RVSTDPNRAD
PASGRELLRI DQPAANHNGG LLLFGPDGYL YIGTGDGGGA GDPLDAGQRL DTLLGKLLRI
DVDKGQPYAI PADNPFLNRN GALPEIWAYG LRNPWRFTFD AVDNILFIAD VGQNAWEEVN
AVPANAAGLN YGWRLMEGEQ CYRPATCDPS GLVMPVTVYP HDSAIGGCSV TGGEVYRGIR
QPALTGVYFY ADFCTGNLWA LWRNTGEWRH ALVARLNLQT TSFGLDEDGE IYLLDRAGSV
YRLVAGE