Gene Cagg_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2072 
Symbol 
ID7269231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2536189 
End bp2537430 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content52% 
IMG OID643566907 
Producthypothetical protein 
Protein accessionYP_002463396 
Protein GI219848963 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATGGG GGGTCGTTAT CCTACTTATG GTTGGTTTGC TCAGTAGTGT AGCCATCACA 
CAACCTGTAC AAGCAGCAAT TACCAATATC GGGGCCGGTG GCTGCCCGGT GAATACTATC
GTGGGCGGCG ATAATGTGGT CCAGAACGGC AATTTTGCCC AGGGTGCGGT CGGCTTCACC
TCACAACTGA TCAATCGCGG TGACGGTGTC TATCCTGATG ACAACAACGG TGGCGGTTTT
TCGATTCAGA TTGGAACGAT AGTGTATCCT CCATTTGAAA CTAACCCCTA TATCTACGGA
CGGTCATTTC CCGGTGATCC ACAACGTGAT GTACCGCCTA CCGACACCTA TTTTTATTCA
AACCCAAGTG CAGCCAACTA TCAGGCAGGT AATGGCCGCG TCAACTTGTG GACGCAAACG
GTAGCAGTTG CTCCCAACAC GACGTACAAC TTCTTTGCCT ATTTCGACAA TCTCCTCGAC
CCGGTGAAGA GTGCGAACGG TGCTGCCGAT CCGATCATCG AATTGCGGGT CAACAATACC
TCAGTTGGCA CGACCGTGAT CCCCAAAACG CCAGATCGTT GGGTGCCGGT GCAGTATGCC
TTTACGACCG GTGATAATGT CACGAGTATC ACTTTGAAAA TCGATAGCCT CACCAATAAC
ACATTTGGTG ACGATTTTGC AATGACGCAG ATCAATCTCA AACAATGCGT CAGTGGGGTG
GGCGTCGCCA AATTTGCCTT CCCGCCCGAA GCTGCTACCC ACAACGGTGT AGAAGGGTTT
CGGCTTGAGT ATTGGATCAC CATCCGGAAC TTAGGTGCTG ATCCGGTGAC CAATCTGGCC
GCTATCGATG ATCTGGCTAC CGTTTTTGCG GCTGCTGAAG ATTGGGATGT GCTCGAACTC
AGTGCCATTA ATGAGAGTGG GTTTACCGTG TTGACGGTAA ATCCCGCGTT CGATGGAAGT
AGTGATCGCA ATCTTCTCGC TACCAATCAG AGTCTTGGTC CTGGTCAAAG CGCACGGATA
CGGTTGGTTG TGTGGGTCAA CCCCCCGGAA GGTCCGACCG TATTTACCAA CAGCGTCCAA
CTATCGGCGC TGTCAGGGAA CGTAGTAGTC ACCGATCTAT CAATGCCCGG TCTCAATCCC
GATCCCAACG GTAATGGTGA CCCCAAAGAA GACGGCGAGA TCGGAGTTAC CGTCTCAATC
TTCTCACCGT ACCAAACATG GGTACCGATA GTGACCCGCT AG
 
Protein sequence
MRWGVVILLM VGLLSSVAIT QPVQAAITNI GAGGCPVNTI VGGDNVVQNG NFAQGAVGFT 
SQLINRGDGV YPDDNNGGGF SIQIGTIVYP PFETNPYIYG RSFPGDPQRD VPPTDTYFYS
NPSAANYQAG NGRVNLWTQT VAVAPNTTYN FFAYFDNLLD PVKSANGAAD PIIELRVNNT
SVGTTVIPKT PDRWVPVQYA FTTGDNVTSI TLKIDSLTNN TFGDDFAMTQ INLKQCVSGV
GVAKFAFPPE AATHNGVEGF RLEYWITIRN LGADPVTNLA AIDDLATVFA AAEDWDVLEL
SAINESGFTV LTVNPAFDGS SDRNLLATNQ SLGPGQSARI RLVVWVNPPE GPTVFTNSVQ
LSALSGNVVV TDLSMPGLNP DPNGNGDPKE DGEIGVTVSI FSPYQTWVPI VTR