Gene Cagg_3553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3553 
Symbol 
ID7266481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4316578 
End bp4317498 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content56% 
IMG OID643568360 
Productperiplasmic solute binding protein 
Protein accessionYP_002464827 
Protein GI219850394 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00157548 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTAC AGTATCTCTG CATTATCGTC TTGCTTCTCA CCCTCACCGG TTGTAGTACA 
CCGGTAACAG ACGGTCGTCT GCGCGTCGTC GCAACTACCG GCCCGGTCGG TGATATTGTG
CAAGTTATCG CCGGTGAACG GGTAGTCTTA CGCACACTCA TCGGCCCCGG CATCGATCCG
CATACTTACG TTGCCACTGA GAGTGATCTA TTTGCCCTCC AAGAAGCCCA AATCGTGTTT
TACAACGGTT TGCACCTTGA AGCTGGGTTA GACCGCATAT TCAAGGCGAT GAACCAAAGT
GGACGCATTC CGGCAATTGC CGTGGCTGAA GCAATTCCTC CCCACCTCCT GCTGTATGCC
GACGAGGGGA GAAACGCTTA CGATCCACAC GTCTGGCACG ATCCGCAACG TTGGAGCTAC
GCTGTTCGGG CGGTGCGTGA TACCTTGATT GCCGTTGATC CGGGTGGTAG GGCCATCTAC
CACCGGCGAA CTGAGCGCTA TTTGGCCGAT TTACAGTCAC TCGACGCTGA GTTACGGGCA
ATGGCAGCGC GGATTCCACC TGAGCGACGT ATTCTGGTCA CGGCGCACGA TGCGTTTCAA
TATTTTGGGC AAGCTTACGG GTTCCGCGTT GAAGCCGTTC AAGGGATCAG CACGGCTAGT
GAAGCAAGTG CCACAGCGAT CAGGTCATTG ACCGAGCTGG TTGTGACGAA TCGTATTCCG
GCGATATTTG TCGAGACGAG CGTTTCACCG CGCACAATTG AGGCAGTGCA GAGCGCAGCG
CGTGCAGTTG GGTATGAAGT GCGTCTCGGC GGTGCGCTGT TCTCCGATTC ACTCGGTGAC
CCTGATGGGC CGGCGGGGAC GTATGTTGGA ATGATGCGCC AGAATATGCA AACGATCGTC
TCGGCGCTGA CCGATATGTA G
 
Protein sequence
MPVQYLCIIV LLLTLTGCST PVTDGRLRVV ATTGPVGDIV QVIAGERVVL RTLIGPGIDP 
HTYVATESDL FALQEAQIVF YNGLHLEAGL DRIFKAMNQS GRIPAIAVAE AIPPHLLLYA
DEGRNAYDPH VWHDPQRWSY AVRAVRDTLI AVDPGGRAIY HRRTERYLAD LQSLDAELRA
MAARIPPERR ILVTAHDAFQ YFGQAYGFRV EAVQGISTAS EASATAIRSL TELVVTNRIP
AIFVETSVSP RTIEAVQSAA RAVGYEVRLG GALFSDSLGD PDGPAGTYVG MMRQNMQTIV
SALTDM