Gene Cagg_2329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2329 
Symbol 
ID7268679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2830433 
End bp2831692 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content61% 
IMG OID643567159 
Productglycosyl transferase group 1 
Protein accessionYP_002463644 
Protein GI219849211 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0153585 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATATCC TGCACGTTGT CCAACTCTAC TGGCCTGCTC CTAGTGGGGC AGCCCGCTAC 
TTTCAAGAGA TCGGCGCCCG CCTCGTCGCC GAAGGTCATC GGGTAACGGT ACTGACGACC
GATGCCTTTG ACCTTGAACA CCTTTGGATG CCCGGCAAAC GACGGATCGC CGAGCCATTC
GGTGAACACG ATGGGGTGCA GATCAGACGG CTAGCGATCC GACGACTACC CGGTCCGGCG
CTTCTCTACC CGATCATCCG TCGGCTGATG GTCGAAGTGG GCCGCTTGGG CCGACCAAGC
GTGCCAATCT TGCGCCGGTT CGCCATGCTC ACACCGCAGA TCCCCGATCT CATCACGACC
CTCGCCGATC CGGCACTGTC CGATGTCGCC GTTGTCCACA CCACCAACAT CACCCTCGAC
TTTGCCCTGA TTCCGGTGGC ACGCTGGACC CAACAGCGCG GCCTACGCCA CATCTGCACA
CCATTCGTGC ATGTCGGTGA ACCGGGGAGT GAGCAAACTG TTCGCTACTA CGCGATGCCA
CACCAGATCG ATCTACTCCG ACGGGCAACG TATGTAGCGA CCATGACCGA GATTGAACGA
GCGTACCTGA TGCGGCGTGG TGTTCCCGCT GCTCAGATCG TCACCGTCGG TGCAGGGGTA
ACGCCCGTCG AAGTTACCGG TGGCGACGGA CGGCGTTTTC GGGCCACCCA GCAGATTAGT
GGGCCGATGG TGCTGAGTTT GGGTGTGGCT GCGTTCGACA AAGGCGCGAT CCATACGTTA
GCGGCGATGC GACGGTTATG GGCACAAGGA AGTGACGCAG TATGGGTGCA ATGTGGACCG
GCGTTTGGTG GTTTTGCCGA AGCAGTAGCG GCCCTGACAC CGTCCGAACG AACGCGCGTG
CGGATTCTTG GCTACGTTGA TGACGATACC CGCCGCGACG CACTCGCCGC TGCCGACGTG
TATGTACAGC CATCGCGCAC CGACAGCTTT GGTATCACCT ATCTCGAAGC GTGGTGCAAC
GGTGTGCCGG TGATCGGAGC GCGGGCCGGT GGTGTACCGG CAGTAATCCG GCACGGGGTT
GATGGCTGGT TGGTGCCATT TGGGAATGTA CCGGCGATTG CTGAGGCCAT CGACCGTCTC
CTGCGTGATC GAGCATTGGC ACGAGCAATG GGCGCTGCCG GACGAGCACG TGTTTGGCGT
GAACTGACGT GGGATGCCGT TTACCAACGA ATCCGCCCGC TATATGGCGA GCTTGGGTGA
 
Protein sequence
MHILHVVQLY WPAPSGAARY FQEIGARLVA EGHRVTVLTT DAFDLEHLWM PGKRRIAEPF 
GEHDGVQIRR LAIRRLPGPA LLYPIIRRLM VEVGRLGRPS VPILRRFAML TPQIPDLITT
LADPALSDVA VVHTTNITLD FALIPVARWT QQRGLRHICT PFVHVGEPGS EQTVRYYAMP
HQIDLLRRAT YVATMTEIER AYLMRRGVPA AQIVTVGAGV TPVEVTGGDG RRFRATQQIS
GPMVLSLGVA AFDKGAIHTL AAMRRLWAQG SDAVWVQCGP AFGGFAEAVA ALTPSERTRV
RILGYVDDDT RRDALAAADV YVQPSRTDSF GITYLEAWCN GVPVIGARAG GVPAVIRHGV
DGWLVPFGNV PAIAEAIDRL LRDRALARAM GAAGRARVWR ELTWDAVYQR IRPLYGELG