Gene Cagg_1399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1399 
Symbol 
ID7267251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1725075 
End bp1726427 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content54% 
IMG OID643566242 
ProductTPR repeat-containing protein 
Protein accessionYP_002462742 
Protein GI219848309 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTG ACAGCCTGTA TCAAGATCGA CGTGAAGCAG CACTGGCCGA TTTCAATCGA 
GCCATCGCCA TCGATCCAGA TTATGCATGG GCCTACTTCC AACGCGGACA AGTCTTGCGC
GAACTAGGGA GAATGGAGGA ATCATTGGCC GATCTTCGTC GAGCGTGTGA GCTAGAACCG
AACGATGCAG CATACCACGC CGAACGAGGC GAAACGCTGC GCCTAATGCG CCGTTATGAA
GAGGCTCTTA CCGCCTTCTC ACGCGCCATT GAACTACGAC CTGAATATCC GTGGGCATTA
GGAAGCCGCG GACAAGTATG GCGCGCACTG CGCCGTCATC ACGAAGCACT TGCCGATTTC
GAGGCAGCAT TGGCACTGAA CCCAATGCTG GCATGGGTGC ATGCCGAACG TGGCGAAACC
TTGCGGGCAT TACGTCGCCT CCATGAAGCA ATTGAAGCCT TCAACCAGGC GTTAAACATC
GATCCGGACT ATGTGTGGGC GTTAGGTCAT CGCGGCATTG CCTACCGTGA ACTCCGTGAC
TATCCGGCAG CCATCGCCGA TTTCGATGCG GCCATTACTT TACAAGATAC TATCGCGTGG
CTGTACGCCG AGCGCGGTGA AACACGTCGC CTGGCCGATG ATTTCGAGGG CGCCTTGTTT
GATCTCAACC GGGCAATCGA ACTTGACCCC CAATATGCAT GGGCATTAGG GAGTCGCGGT
GCCACTTTCC GTGCGCTCGG TGATACCGAA GCGGCATTGG CCGATTTCAA TCGTGCGCTT
GAGCTTGATC CAGCTTATGA ATGGGTATAT ATGCAGCGCG GTCTCTTGTA CCGCAACCTC
GACCGACTCG ATGAAGCACT CGCCGACTTT AGCCGTGTCC TCGCACTCAA CCCCAATAAC
GTCGGTGCGC TCGTCGAACG AGGTGAGTTG CTCCGTCTGC GCCGTCACTA CAACGAAGCG
CTGACCGATT TCAGTCATGC GATTGAGCTT CAACCCGATC ATGCATGGGC AATCGGCAGC
CGAGGGCAAG TATATCGGGC ACTCAATCGT TATCACGAGG CGCTCGCCGA TTTCAACAAT
GCACTTGAGC AAAAACCCGA CCTGGTATGG ATCCTGGCCG AACGGGGTGA AACCTACCGC
TTGCTGCACC AGTACAACGA AGCGTTAGAA GATTTCAACC GCGCCCTCGA ATTGCAACCA
AATGATTCGT GGGTACTCAG CCGGCGTGGT GCTACCTATC AGGCACTAAA GATATTTGAA
GAGGCTTATA TTGATCTGAC CCGCGCCATT GAAATCGATC CGAATAATGC ATGGGCTTTG
GCACAACGTG GCTCACTCTT CCGACAGATG TAA
 
Protein sequence
MNLDSLYQDR REAALADFNR AIAIDPDYAW AYFQRGQVLR ELGRMEESLA DLRRACELEP 
NDAAYHAERG ETLRLMRRYE EALTAFSRAI ELRPEYPWAL GSRGQVWRAL RRHHEALADF
EAALALNPML AWVHAERGET LRALRRLHEA IEAFNQALNI DPDYVWALGH RGIAYRELRD
YPAAIADFDA AITLQDTIAW LYAERGETRR LADDFEGALF DLNRAIELDP QYAWALGSRG
ATFRALGDTE AALADFNRAL ELDPAYEWVY MQRGLLYRNL DRLDEALADF SRVLALNPNN
VGALVERGEL LRLRRHYNEA LTDFSHAIEL QPDHAWAIGS RGQVYRALNR YHEALADFNN
ALEQKPDLVW ILAERGETYR LLHQYNEALE DFNRALELQP NDSWVLSRRG ATYQALKIFE
EAYIDLTRAI EIDPNNAWAL AQRGSLFRQM