Gene Cagg_3592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3592 
Symbol 
ID7269736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4369956 
End bp4370981 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content55% 
IMG OID643568400 
Productglycosyl transferase family 2 
Protein accessionYP_002464866 
Protein GI219850433 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000663913 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGATG TACAACCAAC CTTCTCGGTC GTCGCGCCGG TGTACAACGA AGAGCAGTTG 
ATCGCCGAGT TTTGTCGGCG GGTTATTGCG GTACTTGAAC CGCTAGGGGA ACCGTTTGAG
CTGGTGTTAG TGAACGATGG CTGTCGCGAC CGCTCACCGG AGATTATGCG CGAGCTGCAC
GAGCGTGACC CGCGAATTAA GGTGATCAAT TTCTCGCGCA ATTTTGGTCA TCAGATCGCG
ATTACTGCCG GTACCGACTA CGCGACCGGT AAAGCAGTGA TTGTGATCGA TTCGGATTTG
CAAGACCCGC CTGAGGTGAT CCCCGCGCTG ATTGCGCGTT GGCGTGAAGG GTATCAGGTC
GTCTATGGTG TGCGCGAAGA GCGTGAAGGT GAGACGTGGT TTAAAAAGAC AACGGCGTCC
ATCTTCTATC GCTTGATCGT GCGGATTACC AATGTCAACA TCCCGGTCGA CACCGGTGAT
TTTCGTCTGA TGGATCGCAA AGTTGTCGAC GCTCTCAAGC GTATGCGCGA ACATCATCGC
TTTATGCGTG GGTTGTCGGC GTGGGTCGGT TTTCGTCAGA CCGGGGTGCC ATATCGTCGC
CATGCCCGTG CTGCCGGTAC CACCAAATAC CCGTTACGCA AGATGTTGCG TTTTGCCCTC
GATGGCATTA CCAGCTTCTC GTATTTGCCG CTGCAATTGG CAACCTATCT CGGTTTTGTG
GTCGCCGCAA TTAGTATGAT CTTCCTGCTG GTTGTGTTTG TTATGCGGCT AGCGAACCCC
GCGGCTGCCG AACCGGCGTT TTATGGGCAA GCCAGTACGC TGGCAAGCGT GCTCTTCCTC
GGCGCAGTGC AACTGATTTC GCTCGGCATC ATCGGCGAGT ATGTCGGTCG TATTTACGAT
GAGGTGAAAG GCCGGCCACT CTATATCGTC GCTGAAACGT TGGGTATCGC CGAGCCGGAT
GCAACTTCTG CCGCGATGGT ACGTACTTCA TCTACAGAGC ATGAGGTAAC AACGTCATCG
GGGTAA
 
Protein sequence
MSDVQPTFSV VAPVYNEEQL IAEFCRRVIA VLEPLGEPFE LVLVNDGCRD RSPEIMRELH 
ERDPRIKVIN FSRNFGHQIA ITAGTDYATG KAVIVIDSDL QDPPEVIPAL IARWREGYQV
VYGVREEREG ETWFKKTTAS IFYRLIVRIT NVNIPVDTGD FRLMDRKVVD ALKRMREHHR
FMRGLSAWVG FRQTGVPYRR HARAAGTTKY PLRKMLRFAL DGITSFSYLP LQLATYLGFV
VAAISMIFLL VVFVMRLANP AAAEPAFYGQ ASTLASVLFL GAVQLISLGI IGEYVGRIYD
EVKGRPLYIV AETLGIAEPD ATSAAMVRTS STEHEVTTSS G