Gene Cagg_1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1239 
Symbol 
ID7266225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1515474 
End bp1516583 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content52% 
IMG OID643566081 
Producthypothetical protein 
Protein accessionYP_002462583 
Protein GI219848150 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00785292 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAA GTAAGACGGT ACCAACCCTC ATCAGCTTCG CGCTTGTTGT TGTGCTCACC 
GCCTGCGGCG GTACTGCCAG TGCCCCTGCT CCCACTCCCA CAACGGCAGA ATCGGCAAAC
CCAACACGCA CGCCTCGCCC AACAGCCGCT AATGCGCAAC CAACAGCCGC ACCAACCACT
GCCGCAGAGG GATCGATCAT CCGACCGCCG CCCACTAACA ACCAAAGCGG ACTGACCACA
CTCGGTAATG TTCAAGTAGA GATGAGCATT GAGGGGACCA TCAAAACGGA AGGGCAAGCT
GATGAGTCGA TCCAGATAGT GATGCGGGAG ATCCGGCTGC AAAACGGCAA TCGGAACCTT
GTAATCGAAT CGACAACACC TGACCAAGGA ATCGAGCGCA TCAACTACTT TCTCATCGAC
GGTGAAAGCT TTCAATACGC TGAACGCGAC AACGATCGTA CCTGTATTAG CGTGAGCGGC
AGCGATTTCT TTACCGGAAG TATAATCACA CCAGAGTCCT TGATCGGTGA TCTCAAAGAA
GCAACTCTCG CCGAACGTGG TGTACAAGTC AACGGTTTTA CCACCGATCG TTACACCTTC
AGCTTAAACG AGCAGAATCT TGGTTATCAA GGGCAAGCAA ACGGCGAAAT CTGGGTAGCA
AGCAACCCGA ACATCGTGGT ACGACATATT GGCACCCTGA ACGGCTCATT TGGTGGGATA
GCTGTTGAAG AAGGGGGCGA GATACTTCCG CAATCGACCG GGAATCTGAG CTGGAAGTAC
AACGTCACTC AGATCGCGGC AAACACCACC ATCACCCTCC CTGAGGTATG TGCGCAACAG
CAAACAGCCG GTGCCGACAT TCCACTCCCG CCAAACATCA GCAATACGTT GCGTACCAGC
AATTTGATCA GCTTCGAGAC AAGCGACACA GCAGCAAACA TCGCTCAGTT TTATCAGACC
GAGATGGTGG CAAAGGGATG GCAAGCGAGC GAAACCAATC AGTACGGTGA CACATACCAA
CTAACATTTA CCAAGGACGG TCGTACTGCA ACTGTCAACA TCTCGGCAGG TGATAAACAA
ACGATGGTGA TCATCCTTCT TGACTCGTAG
 
Protein sequence
MNKSKTVPTL ISFALVVVLT ACGGTASAPA PTPTTAESAN PTRTPRPTAA NAQPTAAPTT 
AAEGSIIRPP PTNNQSGLTT LGNVQVEMSI EGTIKTEGQA DESIQIVMRE IRLQNGNRNL
VIESTTPDQG IERINYFLID GESFQYAERD NDRTCISVSG SDFFTGSIIT PESLIGDLKE
ATLAERGVQV NGFTTDRYTF SLNEQNLGYQ GQANGEIWVA SNPNIVVRHI GTLNGSFGGI
AVEEGGEILP QSTGNLSWKY NVTQIAANTT ITLPEVCAQQ QTAGADIPLP PNISNTLRTS
NLISFETSDT AANIAQFYQT EMVAKGWQAS ETNQYGDTYQ LTFTKDGRTA TVNISAGDKQ
TMVIILLDS