Gene Cagg_0889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0889 
Symbol 
ID7268342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1115691 
End bp1117010 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content58% 
IMG OID643565737 
ProductABC transporter, conserved site 
Protein accessionYP_002462244 
Protein GI219847811 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1116] ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.807362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00457345 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGCAGG CACAGCCAAA ACCGCTGGTG GAATTGATGG GGGTATCGCA ACGCTACGGG 
CGGGGCGAGC GCCAGTTTAC CGCCATTCAA GATATTAACC TGACGATTGC TGAAGGTGAA
TTTGTCGCTT TGCTCGGCCC GTCGGGTTGT GGCAAGAGCA CGTTATTGCG CATTATCACC
GGTCTTAATC GGCCTAGCGA GGGATTGGTG CGCTATCGGG GTAAGATGCT TACCGGTGTG
AACCCGTTTG CGACCATCGT GTTTCAGACG TTTGCCCTCT TCCCGTGGCT TACGGTTGAA
GAGAATGTGG CGGTGGCGCT TATCGCACGG GGTGTAGATG AAGCTGAAGC GCACCGACGG
GCCGTTGAGT TGATCGATCT GGTCGGTCTT GATGGCTTTG AGCAAGCGTA TCCACGCGAG
TTGTCGGGTG GTATGCGCCA GAAAGTGGGG ATCGCCCGTG CGCTGGCAGT TGATCCCGAA
CTGCTCTGTC TCGATGAGCC GTTCAGTGCG CTCGATGTCT TGAGCGCTGA GACATTGCGG
GGTGAGGTGC TGGAGCTGTG GACGGGTGGG AATCTGAAGA TTCGGGCGGT GTTGATGGTC
AGCCACAATA TCGAAGAGGC GGTCTTTATG GCCGACCGGA TCGTGGTGAT GGATAAGAAC
CCGGGTCGGA TCATCGCCGA AGTGCCGGTT CAGCTACCGC ATCCGCGTGA TCGTAAATCT
GATGCTTTTG CCGCTTTGGT GGCGCGGGTG TATGCTGTGT TGGCCGGACA GACGCAACCG
GAAGCGATTG AGTATGGGAC GGAGCCGGGG CAAAACGGTC AGACGCGGTT GTTGCCACAG
GCTTCGGTTA CGGCGCTGGC CGGTCTGCTT GAGCAGGCGA ACGCGACCGA CCTTGAGCGT
GATCCGATTG CGCAGTTGCA AGATGAGCTT GGGTTGGATC TTGACCAGTT ATTACCGTTG
ATCGAGGCGG CTGAGTTGCT CGGCTTTGCA CGGGTGGAGA GCGGGAACTT GATTCTCACG
CCGCTCGGTG AGGCGTTTGC CGAGGCGAGT ATCCAAGCGC GTAAAGAGAT TTTTGCTTCG
CGGTTGCGGC GGTTGCCGTT CTTCCGCTGG ATGTTGCGCA TGATCATCGC CGCCGATAAT
CAATCGTTGC GTTGGGAAGT GCTGCGGACT GCCCTTGAGC CGGAGTTTCC GGCAGAAGTG
GCCGAGCGAC AGCTCGATGT TGCGTTGGAG TGGGGGCGTT ATGCCGAACT TTTTGCATAT
GACGACGCCG AGGGGAGATT TTTCCTTGAG ACGCCGACTA CGGTGGGTGG TGAACGCTAA
 
Protein sequence
MQQAQPKPLV ELMGVSQRYG RGERQFTAIQ DINLTIAEGE FVALLGPSGC GKSTLLRIIT 
GLNRPSEGLV RYRGKMLTGV NPFATIVFQT FALFPWLTVE ENVAVALIAR GVDEAEAHRR
AVELIDLVGL DGFEQAYPRE LSGGMRQKVG IARALAVDPE LLCLDEPFSA LDVLSAETLR
GEVLELWTGG NLKIRAVLMV SHNIEEAVFM ADRIVVMDKN PGRIIAEVPV QLPHPRDRKS
DAFAALVARV YAVLAGQTQP EAIEYGTEPG QNGQTRLLPQ ASVTALAGLL EQANATDLER
DPIAQLQDEL GLDLDQLLPL IEAAELLGFA RVESGNLILT PLGEAFAEAS IQARKEIFAS
RLRRLPFFRW MLRMIIAADN QSLRWEVLRT ALEPEFPAEV AERQLDVALE WGRYAELFAY
DDAEGRFFLE TPTTVGGER