Gene Cagg_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3131 
Symbol 
ID7269880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3790400 
End bp3791758 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content56% 
IMG OID643567952 
ProductPUCC protein 
Protein accessionYP_002464425 
Protein GI219849992 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.400987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTGC TCCGTTTCGC CGTCAAGACC TTTCGCCTCT CGCTTATGCG GGTAGGCGCC 
GGATGGATGT TCGCCCTTCT TACTTTCAAC TTCAATCGCG TGACCATTGC TGACCTCGGC
GCAATGGCGG TAATCGTCAC CACATTGATC GGACTACACC ACTTCATCTC GTTCTTCCAA
GTGTATTGGG GCCGCTTCAC CGACCGCTAT CCTATCTTCG GTCTCCGGCG CACCCCATAC
GTCATCTTGT CAAATATCGG CGCAGCGTTG ATCTTCATGG CGCTTCCCAG TATTGCCATC
GGTCTTGGTG AGCGTTCACT ACTCGCAACA ATTGAGGCAT TTGCCTTGAT CTTCCTGTTT
GGTGTGCTGA TGGCAATGAA CGGCAGCTCG TCGAACGCTC TTATCGCCGA AGTGACGACC
CCTAAGGAGC GCGGAGCAGT CGTCGCATTT ATCTGGGCGA CGGTCATTAT CAGTGGGATT
GTGTCGGCCG GTGTGTCACG GGCGATTATG CCCCAATACT CACCTGAATC CATGCAATTC
CTCTACAACC TTACCCCGAT TATCGTCACG GTGACCGTCC TACTGGGTGT ACTTGGCCTC
GAGAAGCCGA TCTCGAAAGA GGAACACCGC AAATTGCTCA TAGCAGCTCC AGAAAAGGGT
GAGGCCGGCC CAATCGAGAC GTGGCGGGTA GCGACGAGTC TGATGGGGCG CAACCCACAA
GTGCGCGGAT TTTTCCTCTT TGTACTGCTT GCGATTTTCG GTATCTTCCT TCAGGATGCC
ATTCTTGAGC CATTTGGTGC GGAAGTCTTC AATATGCCAC AAAAAGACAC CGCTGCCTTC
CAACAGATGT GGGGCGCCGG CGCGTTGCTC GGCATGCTCG TGATTGGCAT TTTGTCGAGC
ATCTTCCCCA TTTCCAAGAA GACGATCGCG ACGGTTGGCG GCTTGGGTGT CGCCGGCGGG
TTGGCGATGT TGTCCATCTC GGCCCTGACC CACCAGCAGG GCGTGATTAT GCCGGCACTG
ATGATTATGG GGCTTGGCAT CGGCCTGTTC GACGTCGGCG CACTCGCTAT GATGATGGAA
ATGACGGTTG AAGGTCAGAC CGGCCTGTAT ATGGGGATGT GGGGCATGGC GCAGGGTCTT
GGCAACGGCT TCGCGAACGT AATTAGCGGC TTGGGCCATA CGGTGATGAT CGAGGGCGGC
ATCGTATCGC CGGCGATTGG TTACGGGCTT GTCTTTGGCC TCGAAGCCTT ACTGATGGTG
ACAGCCATCG GCATCTTGCG TGGCATCTCG GTGCAGGAGT TCAAGGGTCT TACCCGACAA
GATATTACGA CGGCATTGGC AATGGATACG GCCTCTTAA
 
Protein sequence
MGLLRFAVKT FRLSLMRVGA GWMFALLTFN FNRVTIADLG AMAVIVTTLI GLHHFISFFQ 
VYWGRFTDRY PIFGLRRTPY VILSNIGAAL IFMALPSIAI GLGERSLLAT IEAFALIFLF
GVLMAMNGSS SNALIAEVTT PKERGAVVAF IWATVIISGI VSAGVSRAIM PQYSPESMQF
LYNLTPIIVT VTVLLGVLGL EKPISKEEHR KLLIAAPEKG EAGPIETWRV ATSLMGRNPQ
VRGFFLFVLL AIFGIFLQDA ILEPFGAEVF NMPQKDTAAF QQMWGAGALL GMLVIGILSS
IFPISKKTIA TVGGLGVAGG LAMLSISALT HQQGVIMPAL MIMGLGIGLF DVGALAMMME
MTVEGQTGLY MGMWGMAQGL GNGFANVISG LGHTVMIEGG IVSPAIGYGL VFGLEALLMV
TAIGILRGIS VQEFKGLTRQ DITTALAMDT AS