Gene Cagg_2431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2431 
Symbol 
ID7266154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2950408 
End bp2952210 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content62% 
IMG OID643567257 
Producthypothetical protein 
Protein accessionYP_002463740 
Protein GI219849307 
COG category[S] Function unknown 
COG ID[COG5373] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.482444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAC GTATTCCGGT CGATCTGCCG TTCACGATTT TCCTGCCCGC AGTTGACGGT 
GATCTGACGA TCCACGGTTG GGATGAACCG AGTGTTGAAT GGCAGTGTGA TGACCAAGTG
AACCTCACCA GGAATGAAGC CGGTCTGACG CTTGAGCCGT GTCACGGCGA TCTCCGGCTG
ACAGTGCCGG CGATGGCTGA AGTGCGGATC ACCGGTTGCA ATGGCGATAC ACGGGTGATG
CAGGTGCGTC GGTTGCTCAT CGAACATCAG CACGGCGACC TGGTCATCCG CCACATCACC
GAACAAGCGA CCATCGGCCA GCTCTCCGGT GATCTCCACG CGACGGAAGT GGCTGAATTG
AACGTGACCG CTCAGCTTTC CGGTGATGTC ACCTTACTCG CGACGCCGGT TGCGCGCCTA
CACACGGTTG CCGGCGATCT CGCTACGCGC GGTGTGCTGA GCCTATCATT GACCCAACTC
GATGGCGATC TGGTCGCTAC CGACCTCCGC GAGCAGTTGA GTGTTGCCGT GGTGAACGGC
GATGTGGAAG TTACCGGCAA CAATGCTCTC CTCCGCCTCC AGCAGGTGAA CGGCGATCTG
ACCATCCATG GGCAGGTGTC GGTGCTGGAA TGTGTTGCAG TGAGCGGTGA TGTTGACGCC
GAAGAGGCTA CTATCGGCCA ACTAGCTATC GAGACGGTGG CCGGCGATGT TGAGGTGGGT
GTGCTTACCG GTGGGCGGAT CGGAACGGTC GGTGGTGATC TTGAATTGAT GCAGGTTACC
GGCGAACTGA TGATCGGTAA TGTGGGTGGC GATTGTACCA TCAAACACGC CGGCGGCAAT
CTGACCCTCA ACGCGATTGG GAGCGATCTG TCGTTACGCG CCGAGGTCGT TGCCGGCAGC
ACGATTCGCG CTCAAGTAGG TGGTGATGCA GTGATTGTGT TGCCGAAAGA TCCCGATCTG
GTACTGACGG CAACTGCCGG TGGTGAAATC CGTGGTGTCG GTGTGAACCG TTCAGCGCCC
GGTCAGACAG TAGAGCTGCG CTACGGTAAC GGTGCCGCCA GTCTCCATCT GCTCGTCGGT
GGTGACGTGA TCGTCAAAGG TGCAACCCAA CCGGATACGT TCAACGGATT GGCCACGCAA
CTCGGTCACG AACTGAGCAA GCTGGGTCGC GAGTTGGGGC GTGAGTTAAG CGAATTGGGT
CGTGAATTGG CCGCTGAGTT GCGTAACACA CTGGCAAGTG GTGACCCCGC CGCCGCCGAC
CGCGCACGCG CTGCTGCCGA CCGTTTTGCC GCGCAGGCCC GTCGCCTCAA GGAAGAGGCC
GGCCCCGAAC GAATGCGCAT CCGTATCAAC GAACGGGAAT GGCGGCTTGA TCCCGAACGG
ATCGAGCGAA TCAAAGCGCA GGCTCGGCAG GCTGCCGCTG CCGGTCTGAA CGATGCACTT
GAGGCTGTCG AACGGGCGTT GAGCCGCCTC CAACCGCCGC CCCACGCGCC GGCACCACCG
CCACCACCAT CGCATCACGC ACCACCACCA CCCCACGCGC CGGCTCCACC ACCGCCACCC
CACGCGCCGG CTCCACCACC ACCACCCCAC GCGCCGGCTC CACCACCGCC ACCCCACGCG
CCGGCTCCAC CACCACCACC CCACGCGCCG GCGACCGGTC AGACGATCCA ATTACGCCCA
TCGCCCACAC CACCGAGCGA GGAAGATCGC GAACGGCAAC GTGCTGCGAT TTTGCAGATG
GTTGCCGATG GCCGAATCTC GGCAGCCGAA GGTGATCTGC TTCTCACCGC CCTCGACGAT
TAA
 
Protein sequence
MKQRIPVDLP FTIFLPAVDG DLTIHGWDEP SVEWQCDDQV NLTRNEAGLT LEPCHGDLRL 
TVPAMAEVRI TGCNGDTRVM QVRRLLIEHQ HGDLVIRHIT EQATIGQLSG DLHATEVAEL
NVTAQLSGDV TLLATPVARL HTVAGDLATR GVLSLSLTQL DGDLVATDLR EQLSVAVVNG
DVEVTGNNAL LRLQQVNGDL TIHGQVSVLE CVAVSGDVDA EEATIGQLAI ETVAGDVEVG
VLTGGRIGTV GGDLELMQVT GELMIGNVGG DCTIKHAGGN LTLNAIGSDL SLRAEVVAGS
TIRAQVGGDA VIVLPKDPDL VLTATAGGEI RGVGVNRSAP GQTVELRYGN GAASLHLLVG
GDVIVKGATQ PDTFNGLATQ LGHELSKLGR ELGRELSELG RELAAELRNT LASGDPAAAD
RARAAADRFA AQARRLKEEA GPERMRIRIN EREWRLDPER IERIKAQARQ AAAAGLNDAL
EAVERALSRL QPPPHAPAPP PPPSHHAPPP PHAPAPPPPP HAPAPPPPPH APAPPPPPHA
PAPPPPPHAP ATGQTIQLRP SPTPPSEEDR ERQRAAILQM VADGRISAAE GDLLLTALDD