Gene Cagg_0802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0802 
Symbol 
ID7268121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp996622 
End bp998034 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content52% 
IMG OID643565653 
ProductO-antigen polymerase 
Protein accessionYP_002462162 
Protein GI219847729 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTCTT TGCTGTACGC TTTTGAAGCA CCACCGTGGC TGCGACGACT GCCACAACCG 
CTGCTTGTGG GCGGAGTGAT CGGTTGGCTA GCAGTGTATA GCCTCGCGAT CGGGGTGATG
TTCGGCAGCA ATCGCACGTT GATCGGATTG GCCCTATTGG CATTACCGTT TGGGCTGATT
GGTCTTACCT TACTGCTCTA CCGATTCGAG TGGTTTGTGT TGATCTTGCC ATTGACCGCA
CTTGCGATGC GACCGGTGGC GTTGCCGGCC GGTAATAATA GCCATTTGCC GATCAGTATG
CTGCTTACCC TCGCGTTGTG CGGCATTTGG GTGTTGGCAA TGATCAAACG ACGCACGTGG
CAACTAACAC CTTCACCGCT CAACAAGCCA CTGCTCGCCT TGATGGGCTG TTTCATCTTC
TCGACCATCT GGGGCACAAT CTGGGCCGAT CCGATCCTTG ATTGGTGGAT TATGGGCAAT
TTTCGCTTGG CCCAATTCGC TTCACTGCTC TCTTTTCTTG GCTTGCTCGC CACACCGTTA
CTGATTGGGC GTTTCATTCG GTTTAAATGG CAGATCAAAG CCTATCTGGC AATGTTCATC
ATTTGCGGCA GCCTGATGAC TGTCGCTCAG ACGTTCGGTA TCGATCAAAT TATGTTAAAC
GATGCCGGCC TATGGGGGCT TTGGTTTGCG CTTCCACTTG CCGGAGTCAC CTATCTCCAA
CCACGGCTAC ACTGGCGATG GCGGTTGGCG GGTAGTGTAC TACTCCTCTG GCACTTGTGG
CTGGCTGCCA TTCGCAATTC ACTTTGGATT TCGGGTTGGC TACCAACTAT CATCGGCCTT
GTTGTTATGA CCTTTCTTAT ATCACGACGT ATCTTTTTCG TTCTCGTCCT TATTATTGCT
ATCAATCTGG CGATTGGGCC TGGCAGACAC TACATCGACC AAGTGGTCAA CGAGAACATT
GAAGAGGGAG GGTTGGGTCG GCTCGAAATC TGGCAACGCA ATCTCTCGAT CGTCCAGCAA
CACTGGCTTT TTGGGATGGG AGTTGCCGGG TATGCACCGT ACAACATGAC CTATTTTCGT
TACGATGCTC GTTCGACCCA CAACAATTAC TTCGATATTC TGGCTCAATT TGGTGTCATC
GGCTTTGGCC TCTGGCTCTG GTTCACCATT GTTAGTATCC GGTACGGTTG GCGTACCATT
GCGCTTGCAC CACCGGGCAT TTTACACACC ACCGCCATTG TGGCCATCGC CGGTTGGATA
GCAGCTCAGT TCTCGATGAT GCTCGGTGAT TGGATTTTAC CGTTTCTCTA CAACCAGACC
GTCGCCGGTT ATGCATATAC CGTCTATAGC TGGATATTCC TCGGCTTACT GATTAGTGTG
CGACAGTTGG TGCAGAAAGA GCCATTGTCA TGA
 
Protein sequence
MRSLLYAFEA PPWLRRLPQP LLVGGVIGWL AVYSLAIGVM FGSNRTLIGL ALLALPFGLI 
GLTLLLYRFE WFVLILPLTA LAMRPVALPA GNNSHLPISM LLTLALCGIW VLAMIKRRTW
QLTPSPLNKP LLALMGCFIF STIWGTIWAD PILDWWIMGN FRLAQFASLL SFLGLLATPL
LIGRFIRFKW QIKAYLAMFI ICGSLMTVAQ TFGIDQIMLN DAGLWGLWFA LPLAGVTYLQ
PRLHWRWRLA GSVLLLWHLW LAAIRNSLWI SGWLPTIIGL VVMTFLISRR IFFVLVLIIA
INLAIGPGRH YIDQVVNENI EEGGLGRLEI WQRNLSIVQQ HWLFGMGVAG YAPYNMTYFR
YDARSTHNNY FDILAQFGVI GFGLWLWFTI VSIRYGWRTI ALAPPGILHT TAIVAIAGWI
AAQFSMMLGD WILPFLYNQT VAGYAYTVYS WIFLGLLISV RQLVQKEPLS