Gene Cagg_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1401 
Symbol 
ID7267253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1726735 
End bp1728015 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content58% 
IMG OID643566244 
Productglycosyl transferase group 1 
Protein accessionYP_002462744 
Protein GI219848311 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.52181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCTA CCCATCCACA AACAGGTCAC CCGATCCGGG TGGCGATGGT GGCTCGCGCA 
GTTTTTCCCC TTCACGGATA TGGCGGCATT GAACGCCACG TGTATCACCT CACCAAATAT
CTGACCCAGC TCGGAGTTGT CGTCACGCTA TACGTCCAAT CACTACCCCC TAACGCCACA
CCGGGTGACC TGCGACCACA CGCGATTGAA ACGTTACGCT ACGACTATAC CTCACCTATC
CTCCCACCAA ATGGTGTGAT CGGTCGACAG ATCAACTTTC CGATCTACAG TTGGCGCATT
GGCCGACGGG TTGCCCGGCA CGTACTGCAT AGTAATTTCG ACGTGGTACA TACTCAAGGG
TTGTGCGCTT TTGGCTACGC CGCCGCCCGC CACCGCGATC CCCGTTTACA GACCACACCG
TTCGTTGCCA ACCCACACGG TATGGAAGAA TTCCGTACCT CCGACCGCAT CAAATGGCTG
GCTTATGCAC CGTTTCGCTT CCTTTATGCC TACGGCCATC GTCAGGCCGA CCGCGCCATC
GCAACCGATT CATGCACCAA AGACGATTTG CCGCGCTACC TTGGCGTCGA TCCGGCACGA
GTTGTGGTCA TCCCATCAGC TATCGACGTT GCCGAATGTT TATCTCAGGT ACGCAGCGAA
CTTCGTACCG CCCTGCGTTT CCGTCTCGGC CTTACCAGCG GTAACCCGAT CTTGCTCAGC
GTTGGACGAC TCGAACCCAA TAAGGGGTTT GATGTGCTTA TTGCCGCCCT GGCCCGCTTA
CGGGACGAAT TGCCGCCCAA CTGGCGCTGG CTGCTGGTCG GCAGTGGATC GGCCCGCACC
GCGCTCGAAC AGCAAATCCG TGAAGCAGGG ATCGCCGAAC ACACCGTACT GGTCGGGCGC
CTGAGTGACG AAGAACTGCA TAGTCTGTAT GAAGAGATCG ACCTCTTCGT TCACCCAACC
CGCTATGAAG GCAGTTCGCT GGTGACGCTT GAGGCGATGA TCCATCGCCG GCCGGTGGTC
GCCAGCGCCA TCGGCGGTAT TCCCGATAAG GTCTTTCCGG GACGTAACGG CTTGTTGGTC
AAACCGGGTG ATGTTGATGA TCTCACCGCC CAACTCCGCG CAGCTTTAGC CGCCCGCGCA
CAGTGGTCAG AATGGGGGGC AGAGAGTGAG CGCATTGTGC GCTCAACATT CGATTGGCCG
GTCGTGGCTC AACAGACGCT CGCCCTCTAT CACGAGTTGC TGGCCGGCAA CCATACCTCA
CACCCCACCC GATCGGGGTG A
 
Protein sequence
MDATHPQTGH PIRVAMVARA VFPLHGYGGI ERHVYHLTKY LTQLGVVVTL YVQSLPPNAT 
PGDLRPHAIE TLRYDYTSPI LPPNGVIGRQ INFPIYSWRI GRRVARHVLH SNFDVVHTQG
LCAFGYAAAR HRDPRLQTTP FVANPHGMEE FRTSDRIKWL AYAPFRFLYA YGHRQADRAI
ATDSCTKDDL PRYLGVDPAR VVVIPSAIDV AECLSQVRSE LRTALRFRLG LTSGNPILLS
VGRLEPNKGF DVLIAALARL RDELPPNWRW LLVGSGSART ALEQQIREAG IAEHTVLVGR
LSDEELHSLY EEIDLFVHPT RYEGSSLVTL EAMIHRRPVV ASAIGGIPDK VFPGRNGLLV
KPGDVDDLTA QLRAALAARA QWSEWGAESE RIVRSTFDWP VVAQQTLALY HELLAGNHTS
HPTRSG