Gene Cagg_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1964 
Symbol 
ID7268880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2399438 
End bp2400595 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content51% 
IMG OID643566801 
Productglycosyl transferase group 1 
Protein accessionYP_002463294 
Protein GI219848861 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0014923 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATCGTA GGATTCTCAT CTTGGCCAGC TGGTATCCAA ACGCGACGTC GCCAGTGGGT 
GGCGTATTCA TCAAAGAACA GGCGCAAATT CTGGCAGAAC ACTTTGAGGT AACTGTTCTG
GCAGTACAGA CGACAGGATT GCGCGCTACA TTACGCAATC ACTCTTACCA TGGTATCGAG
CGCGACGGAA ATCTCACCAT CTATCGTGTA TATGCACCAA CCCTGCCGTA TGTGCCTCAC
CTGAGTCTCC TGAGCAGCGC CTATTTTGCC TGGCAGGGAT TGCGACAATA TATTCACCAC
CACGGTCGTC CTGATCTTAT TCACGCCCAC GTTGTGCTAC CGTGTGGTTG GATCGCGGCA
CGCGCTGCGC AAGCGTGGGG TGTTCCGGCA ATTCTGACAG AACATACCAG TCCTTTTACC
GTTCATCTAT ACACGAGATT GCAGCGGTAT TTGGTGCGTG AGACAATAAT ACATCTTCCC
GTGTTGGCAA TTAGTCCGTC GCTCAGACAG CGCATTCTTG AATTTGTGCC AACTACCGAT
GTGCGAGTGC TGGGTGAAGT CATCAAAACG CGCTTCTTTA CACCCTCGGA AACAGATGCC
GAACCCACTC AATCCAAAAA GCGATTTCTA ACCGTTGCTC TTTTGACCGA ACAAAAGGGG
GTCGATCATC TTTTACAAGC TGCTGCCTTA CTCCGTCAAC AGATTGATTG CCCGTTCGAG
CTGGTCATCG GCGGCGATGG GCCAGCGCGC CCACGATTGG AACAGTTGGC TCGCCAATTC
GGTCTGAAAG ATATCTGTCG TTTTGTCGGT CTCCTCAATC GGACACAGGT GCGTGATTGG
ATGCGCTGGT GCGATGTGTT CATTCTGCCG AGCATTCACG AAACGTTTGG TGTCGTACTA
GGTGAAGCAA TGGCATGCGG AAAACCGGTC ATTGCCACGC GTTGTGGCGG ACCGGAGTTT
GTCGTTGAAG ATGGATGCGG TTTGCTCGTG CCCATTGCCG ACCCATATGC GCTTGCTGAC
GCAATGAAGC AGTTTTTGCA GGACCGGGTG CAGTACGATC CATCTCTTAT TCGAGAAAGC
GTTTGTCAAC GTTTTGGCGA AGAAGCGTTT TTACGAAACA TTGAGACAAT CTACAACGAA
ATATGGTCAA AATCATGA
 
Protein sequence
MHRRILILAS WYPNATSPVG GVFIKEQAQI LAEHFEVTVL AVQTTGLRAT LRNHSYHGIE 
RDGNLTIYRV YAPTLPYVPH LSLLSSAYFA WQGLRQYIHH HGRPDLIHAH VVLPCGWIAA
RAAQAWGVPA ILTEHTSPFT VHLYTRLQRY LVRETIIHLP VLAISPSLRQ RILEFVPTTD
VRVLGEVIKT RFFTPSETDA EPTQSKKRFL TVALLTEQKG VDHLLQAAAL LRQQIDCPFE
LVIGGDGPAR PRLEQLARQF GLKDICRFVG LLNRTQVRDW MRWCDVFILP SIHETFGVVL
GEAMACGKPV IATRCGGPEF VVEDGCGLLV PIADPYALAD AMKQFLQDRV QYDPSLIRES
VCQRFGEEAF LRNIETIYNE IWSKS