Gene Cagg_1177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1177 
Symbol 
ID7267926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1452349 
End bp1453497 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content56% 
IMG OID643566020 
Productglycosyl transferase group 1 
Protein accessionYP_002462522 
Protein GI219848089 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0229263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTCG CTTTAGTTCA CGATTATCTT AATCAATATG GTGGCGCCGA GCGGGTGCTC 
GAGGCGCTTC ACGAGCTGTT TCCGACGGCG CCGATCTATA CCTCAATCTT CGACCCCACG
GCAATGCCAG TGGTGTACCG GCAATGGGAC ATTCGCACCT CATTTATGCA ACGCCTGCCG
GCATGGCGTA CCCAATTCCG CCGTTATGTT GCCCTGTATC CTACGGCATT CGAGCAGTTT
GATCTGAGCA GCTACGACCT GATCATCAGT AGTTCGAGTG CTTTTGCCAA GGGCATTATC
CCACGTCCGG GCGCGTTACA TATCTGCTAC TGCCATACAC CGATGCGTTT CGCGTGGCGC
ACCGATGATT ACGTAGCCCG CGAGCAGATT AACGGTCTGC AGGCTAAGCT CTTACCGTTT
TTACTCAATT ATCTCCGCAT CTGGGATACG GTCAGTGCTA ATCGGGTTGA TCTGTTTGTT
GCTAACTCCC GTGAGGTCGC CGGACGGATT GCACGGTACT ATCGCCGGCC GGCGATGGTG
ATTCCCCCAC CGGTCGATCT TCCATCCTAT GCACCACGCC AACCCGAAGA GTTCTATCTG
GCCGGTGGGC GATTGATCCC GTACAAGCGG CTCGAATTAG CAATCGAAGC GTTTAACCAT
CTTCGCTTAC CCTTGAAGAT TTTCGGCGAT GGACGTGACC GTGCTCGCCT TGAACGTATG
GCCGGTCCCA ATATTGAGTT TCTGGGGTGG GTTGATGAAG CGACTCGTCT CGATCTCTTC
GCTCGCTGCC GGGCCTTTAT CTTCCCAGGT GAAGAAGATT TTGGCATTAC TCCGCTCGAA
GTGCTGGCTA TGGGCCGACC GGTCATCGCC TATGCCGCCG GCGGTGCTCT CGAAACGTTG
ATCGACGGTG TGACCGGCCG GTTTTTCTAT CAACCTACCG CCGCAGCTCT CGCCACCGCT
GTTGCCCTCT CACGTACCGA CTATATTGAT CCACTTGTGC TGCGTCGCCA CGCCGAACAG
TTTAGCCGTC CTCGTTTTCT CGCTGCGATG CGCAACTTGA TCGACGAGGC ACTTACTGCC
CAGCACACCG GCCGCCTCGC CGAATTTGAA CAAAGCTTCG CCCAATTGTC TCTGCCGGTA
TCACGATAA
 
Protein sequence
MRVALVHDYL NQYGGAERVL EALHELFPTA PIYTSIFDPT AMPVVYRQWD IRTSFMQRLP 
AWRTQFRRYV ALYPTAFEQF DLSSYDLIIS SSSAFAKGII PRPGALHICY CHTPMRFAWR
TDDYVAREQI NGLQAKLLPF LLNYLRIWDT VSANRVDLFV ANSREVAGRI ARYYRRPAMV
IPPPVDLPSY APRQPEEFYL AGGRLIPYKR LELAIEAFNH LRLPLKIFGD GRDRARLERM
AGPNIEFLGW VDEATRLDLF ARCRAFIFPG EEDFGITPLE VLAMGRPVIA YAAGGALETL
IDGVTGRFFY QPTAAALATA VALSRTDYID PLVLRRHAEQ FSRPRFLAAM RNLIDEALTA
QHTGRLAEFE QSFAQLSLPV SR