Gene Cagg_2161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2161 
Symbol 
ID7267669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2654107 
End bp2655240 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content59% 
IMG OID643566992 
Productglycosyl transferase group 1 
Protein accessionYP_002463480 
Protein GI219849047 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00492248 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTGTGG CGCTCATTGC CGAGACGTTT CTACCCGATG TCAATGGTGT AACGACAACG 
CTCTGTCGCC TCCTAGAGCA TTTACAGCGC ACCGGTCACG AAGCTGTGCT ATTCGCACCC
CAAGGTGCGC CGACAAGCTA TGCCGGTGCA GAAATCGTAC CACTCAGCGG AATGCCGTTA
CCGCTCTATC CTGAAGTCAA ACTCACTCCA CCACAACCCG GCCTAACGGC CCGCTTGCGT
AGCTTTCAAC CCGACGTCGT GCATTTAGTC GGACCGGTAG TGTTAGGGGC AATTGTCCCC
GGTATCGTCC GTCGGCTCGG ACTACCCCTG ATCGCCTCGT ACCACACCGA CTTTGGCGCA
TACAGCCGAC ACTACGGTTT CGGTTTCTTA CAACACGGCG TCAATGCATG GCTGCGTTGG
ATTCACAACC GTTGCCGGAT TAACCTTTGT CCTTCGAGTT TTACCCTTCA TGCTCTCCGT
GCCGCCGGTT TTCGCCGCTT GCGGATTTGG GGACGCGGCG TCGATATCGA ACGGTTCCAC
CCGCGCTATC GCAGTGAAGC GTGGCGGGCT GCTATCGGGA TACAACCGGG TGAGCGGTTA
GTGCTCTATG TAGGTCGGGT AGCCGCCGAA AAGCGGGTCG ATCTGTTACC GGAAGCCATC
CGCGGCCTGC CGAACGTCCG CCTCGTAATT GTCGGCGATG GACCCTTCCG CGCCGAGTTG
CAACGGCGTT GCGCTGGTCT GCCGGTGCAT TTTACCGGTT ATCTTAAGGG AGAGGCTTTG
GCGGTAGCTT ATGCAAGCGC CGATGCGTTT GTCTTCCCCT CCGATACCGA CACCTTCGGA
CAAGTTATTC AAGAAGCGAT GGCTTCCGGC TTACCGGTCG TGGCTGCACG GGCCGGTGGT
GCGATCGATC TGGTACGTCA CGGCCACAAC GGGTATCTGT TTACTCCCGG CGTTGTTACC
GATTTGCGCG CCCGCCTCCG AGAACTACTC GCCAACGACA GCCGTCGGAT CACACAGGGG
CTGGCCGGAC GCGCTGCTGC CGAACGACGA TCGTGGCCGA GTGTGATGGA TGAACTCATG
GGGTATTACA CGCGAGCAAT GTCGCATCGC CGTTTGGGAA GACAACCAGG TTAG
 
Protein sequence
MRVALIAETF LPDVNGVTTT LCRLLEHLQR TGHEAVLFAP QGAPTSYAGA EIVPLSGMPL 
PLYPEVKLTP PQPGLTARLR SFQPDVVHLV GPVVLGAIVP GIVRRLGLPL IASYHTDFGA
YSRHYGFGFL QHGVNAWLRW IHNRCRINLC PSSFTLHALR AAGFRRLRIW GRGVDIERFH
PRYRSEAWRA AIGIQPGERL VLYVGRVAAE KRVDLLPEAI RGLPNVRLVI VGDGPFRAEL
QRRCAGLPVH FTGYLKGEAL AVAYASADAF VFPSDTDTFG QVIQEAMASG LPVVAARAGG
AIDLVRHGHN GYLFTPGVVT DLRARLRELL ANDSRRITQG LAGRAAAERR SWPSVMDELM
GYYTRAMSHR RLGRQPG