Gene Cagg_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3800 
Symbol 
ID7267874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4635653 
End bp4636861 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content57% 
IMG OID643568608 
Productglycosyl transferase group 1 
Protein accessionYP_002465072 
Protein GI219850639 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.17673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.211423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGTAC TACAGATTTC GTGGGAATAT CCCCCTCACA TCGTTGGCGG TTTAGGCCGC 
CATGTGGCAG ATTTAGTGCC GGCATTGATC GAACACGACG TAACGGTAAC CGTCATCACT
CCGCAATTAC GTGGCGGTGA ATCGTTCGAA CGTCAGGAAC AGATCACTAT CTGCCGGGTT
GCCATTCCGC CGATCGATAC CGATGACTTC CCCAGCTTTG TCTATCACGC TGGGTGGCAT
CTTGAACACG CAGCCCATGC CCTGTTTCCC GGTGGCCGAC CCGATCTGGT CCACGTTCAC
GACTGGCTGA CCGCCGAGGT TGGTATCACG CTCAAGCACC ATTGGCGTGT CCCGCTGATT
GCTACCATCC ATGCGACCGA ACGTGGGCGT GGACGTGGTG ATTTGAATGG GCACCGAGCA
CAGCACATTA ACGATCTCGA ATGGCGTTTA ACCTACGAGG CATGGCGGGT GATTGTTTGT
TCGCACTTTA TGGCACGCCA GATCCGTGAG TACTTTACCA CTCCGCCAGA CAAGATCGAT
GTTATTTCCA ATGGTGTTCA TATTCCGCCC CCACCGTTTG AAACACCGGC GGAATGGGTC
GCCTTTCGGC GCCGTTTTGC CGCCGATAAC GAAGCGTTAG TGATTTTTGT CGGACGGCTC
GTTTACGAAA AGGGTGTGCA TGTCTTACTT GAGGCCTGGC CACGGGTTAT TGCCGAAATA
CCGGCTCGCT TAGTGATCGC CGGCACCGGT AGTGCTTTCG ATGAACTCAA GTGGCGGGCC
GCAGAGTTGG GAGTGCCGGT TGAGTTTCTC GGCTATATCA GTGACGAAGA TCGGAACCGT
TTGTACGCCG TGGGTGATGT GGCCGTCTTC CCCTCGCTCT ACGAGCCGTT TGGGATTGTG
GCGCTGGAAG CCTTTGCGGC CGGTTGTCCG GTGATTGTCT CTGATGCCGG TGGGTTGGCC
GAGGTTGTCC AGCACGAACT GAACGGGTTA GTTGTCCCCG CCGGCAATGT GGTGGCGCTC
GCCAATGCGT TGCTGACCAG TCTGCACGCC CCCGCCGAGT CGCGGATGCG TGCGGCGCGT
GGGGCTGCGC TGGCCCGCGC TTACTACACC TGGAACCGAA TTGCCGGGGA GGTAAAAGCG
TTGTACGATC GGGTATGGAC GGAGTGGAAG GCCGGTGCAT GGGGGAAAGA GATTATGCGT
CGCGCCTGA
 
Protein sequence
MHVLQISWEY PPHIVGGLGR HVADLVPALI EHDVTVTVIT PQLRGGESFE RQEQITICRV 
AIPPIDTDDF PSFVYHAGWH LEHAAHALFP GGRPDLVHVH DWLTAEVGIT LKHHWRVPLI
ATIHATERGR GRGDLNGHRA QHINDLEWRL TYEAWRVIVC SHFMARQIRE YFTTPPDKID
VISNGVHIPP PPFETPAEWV AFRRRFAADN EALVIFVGRL VYEKGVHVLL EAWPRVIAEI
PARLVIAGTG SAFDELKWRA AELGVPVEFL GYISDEDRNR LYAVGDVAVF PSLYEPFGIV
ALEAFAAGCP VIVSDAGGLA EVVQHELNGL VVPAGNVVAL ANALLTSLHA PAESRMRAAR
GAALARAYYT WNRIAGEVKA LYDRVWTEWK AGAWGKEIMR RA