Gene Cagg_2468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2468 
Symbol 
ID7266192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2996966 
End bp2998126 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content59% 
IMG OID643567294 
Productglycosyl transferase group 1 
Protein accessionYP_002463776 
Protein GI219849343 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.822544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATTG TGATGATTGC GCCCTTCGGG ATCAGGCCGA AGGGTACGCT CTTGGCCCGC 
ATGTTGCCGT TGGCACAGGC ACTGCACCGA CGGGGCCATG CCGTGACTAT TGTGGCCCCG
CCGGTACATA ACCCTACCGA TGGCGGAACA CACCGCTCCT ACGATGGAGT GATCGTTATC
CACACGGCCA CACCACCGCT GAGTGGGCTA CCGGCGGCGC TTTGGCATAC GATTGCGCTC
TGGCAACAGA CACGCCGGCT CCGTCCCGAT GTGATTCATC TGTTCAAACC AAAAGGTTTT
GGTGGCCTGG CAGTGTTGGC ACGTGGTCGG GTGCCACTCG TTGTGGATTG CGACGACTGG
GAGGGGCCGG GTGGCTGGAA CGATCTCCTG CCGTATCCAC GACCGGCCAA ATTGCTCTTC
GCATGGCAAG AGCGCGATCT GCCGCGCCGC GCCGATGCAG TTACGGTTGT TTCCCATACC
CTCGAAACAT TGGTGTGGGC AATGGGCGTG CCACCACAGC GCGTTTTCTA CCTGCCTAAT
GGTGCGGTAC CGAACGAACC ACTCCCGCCG CGACTAACCG AACGACCAAC AATTGTGCTC
TATACCCGCT TCTGGGAACT GGATGTGGCC GAGGTGGCAA CGGTGCTGGC AACGATTCAC
CATGCTCGAC CGACAGCTCG TCTGTTGTTG ATCGGCAAAG GTGAACGCGG TGAAGAGCAG
CGTCTGTTAG CGCAAGCGGC AACCGAGGGA TGGCTCACGA TGATCGATTA TCGCGGCTGG
CAAGAACCTA CCGCTATCCC ATCGTTACTC GCTGAAGCCG ATGTAGCGCT AGCACCCATC
AGTGATACAT TGATCAACCG TGCCCGTGGA ATGGCCAAAT TGGTTGAACT CCTTGCCGCC
GGCTTACCGA TCGTCGCCAG TGATGTAGGT ACGGCACGTG ACTATCTTGC CCCAGATGCC
GGTATTCTCG TACCACCCGG CAATCCGCAC GCACTAGCGG CAGCCGTTAT CAATCTGCTC
GACGATGCAA CGGCACGAGC CAAACTCCGT ACCGCTGCTC TCGCTGCGGC CCATCGTCTC
CGTTGGGATA ACCTCGCCCT CATTGCAGAA ACCGCCTACC GCCAAACCGG TTTGTCAATA
AAATCCAATG CCATGGTATG A
 
Protein sequence
MRIVMIAPFG IRPKGTLLAR MLPLAQALHR RGHAVTIVAP PVHNPTDGGT HRSYDGVIVI 
HTATPPLSGL PAALWHTIAL WQQTRRLRPD VIHLFKPKGF GGLAVLARGR VPLVVDCDDW
EGPGGWNDLL PYPRPAKLLF AWQERDLPRR ADAVTVVSHT LETLVWAMGV PPQRVFYLPN
GAVPNEPLPP RLTERPTIVL YTRFWELDVA EVATVLATIH HARPTARLLL IGKGERGEEQ
RLLAQAATEG WLTMIDYRGW QEPTAIPSLL AEADVALAPI SDTLINRARG MAKLVELLAA
GLPIVASDVG TARDYLAPDA GILVPPGNPH ALAAAVINLL DDATARAKLR TAALAAAHRL
RWDNLALIAE TAYRQTGLSI KSNAMV