Gene Cagg_0966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0966 
Symbol 
ID7268039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1193863 
End bp1195011 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content56% 
IMG OID643565814 
Productglycosyl transferase group 1 
Protein accessionYP_002462320 
Protein GI219847887 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0155699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.836168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATTT TGTGTGCCTT AACGTACTAT CGTCCCTATA CGAGCGGCTT AACGATCTAT 
GTCGAACGGC TGACTCGTGG GCTTGTTCGG CGGGGGCATA CCGTCACCGT ACTCACCTCA
CAGTACGATC CGAGTTTGCC AAAGATCGAG TTTCTTGATG GAGTACGGAT CGTGCGGGCA
CCGGTAGTCG CTCGGGTGAG CAAAGGGGTG ATTATGCCGA CGTTCGGTTG GTTAGCGACC
CGCCTTTCAC TTGAACACGA CGCAATGAGC TTGCACCTTC CGCAATTTGA TGCACCAGGG
CTGGCGCTGC GTGGCAAGGT GCTCAAACAA CCGGTTGTGT TGACGTACCA TAGCGATCTC
AAGCTTCCAC CGGGCTTGCT CAATCGGGTG GCGAATCAGG TTGTCGCGAT TGCTAACCAG
GCAGCAGCAG CATTGGCAAC CCGAATCGTC GCCTATACTC AAGATTTTGC CGACCATTCA
CCCTATCTCC GGCAGTGGCG TCGGAAGGTG ACAATCATTC CACCACCGGT CGAAGTGGCA
CAGGTATCGG ACGATGAAAT TGCAGCGTTT CGACGGCGCT GGAATTTGCA AGGACCGGTG
ATCGGTATGG CTGCTCGGCT TGCTGCCGAA AAAGGGGTCG AGGTACTCTT AGCTGCGCTA
CCGCGCATCC TTGCCATCTA TCCCACGGCT CGGGTACTCT TTGCCGGCCA GCATGAGAAT
GTCTTGGGCG AAGAGGCCTA TGCACGCCGC CTGGCCCCGC TGTTGGAGCA GTTTCGTGAC
CATTGGACGT TCCTTGGTAC GCTGAATCCG CGCGAGATGG CGGCATTCTT TCCCAATCTT
GACGTGCTGG TGGTACCGTC ACTCAATTCG ACCGAGACGT TTGGATTGGT GCAAGTCGAA
GCAATGCTGT GCGGGACACC GACGGTCGCC AGTAATTTGC CCGGTGTGCG TCAACCACCG
TTGATGACCG GCATGGGGAA GGTGGTGCCG ATCGGTGATG CGAACGCATT GGCCGAAGCG
ATCCTCGAAA TAATTGCCCA TCGGTCGCAG TATGTACGTC CCCGTGAAAC GGTAGCAGCT
TTGTTTAGCA CCGAACGCAC GGCAGAGATG TACGAGCAAC TCTTTCGCTC ACTCGGTGTC
GCCGATTGA
 
Protein sequence
MRILCALTYY RPYTSGLTIY VERLTRGLVR RGHTVTVLTS QYDPSLPKIE FLDGVRIVRA 
PVVARVSKGV IMPTFGWLAT RLSLEHDAMS LHLPQFDAPG LALRGKVLKQ PVVLTYHSDL
KLPPGLLNRV ANQVVAIANQ AAAALATRIV AYTQDFADHS PYLRQWRRKV TIIPPPVEVA
QVSDDEIAAF RRRWNLQGPV IGMAARLAAE KGVEVLLAAL PRILAIYPTA RVLFAGQHEN
VLGEEAYARR LAPLLEQFRD HWTFLGTLNP REMAAFFPNL DVLVVPSLNS TETFGLVQVE
AMLCGTPTVA SNLPGVRQPP LMTGMGKVVP IGDANALAEA ILEIIAHRSQ YVRPRETVAA
LFSTERTAEM YEQLFRSLGV AD