Gene Cagg_2117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2117 
Symbol 
ID7267624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2606375 
End bp2608486 
Gene Length2112 bp 
Protein Length703 aa 
Translation table11 
GC content59% 
IMG OID643566950 
Productsulphate transporter 
Protein accessionYP_002463439 
Protein GI219849006 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0607] Rhodanese-related sulfurtransferase
[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00377] anti-anti-sigma factor
[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.271394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0391706 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATT ACCTGCGTTC AGTCATAACT CTCTTTGCCC AACCGATCCG GCTTATCCGC 
ACGTACCCTA TTGATGCGTT GCGCGCTGAC TTCTTTGCCG GACTCACAGT TGGTCTCGTC
TTGTTGCCAC AATCGCTTGC CTTTGCAATT TTGGGTGGGT TACCGCCAAT TGTTGGGCTG
TACAGCGCCA TGACGGCAAC CATCATCGGT GCATTGTGGG GTTCATCAAG TCACGTGAAC
GGTGGTCCAA GCACGACCAG CGCGATCTTG ACCCTCTCTG TGCTCCTCCC GATTGCACCG
ATTGGTAGCG CCGAGTTTGT CACCGCAGCC AGTATGATCG CGGTGATAGC CGGGATCATT
CGCTTAATCA TGGGAATAGC GCGCCTTGGC ATGCTGGTAA CCTTTATTTC GGATGCCGTA
GCTGTGGGAT TTACTGCCGG CTCGGGTCTG TTGATTCTGT TCAACCAATT TGGCCCAATC
ACCAAACTCT CTATCTCACC TGGCGCCAGC ATTCCAACGA TCGTCCAAGA AACGGTCACC
CAACTAACGG AAATCCATTG GCCCTCACTC ATTATCGGTG CCTCTACGAT TGCTCTTATC
TATCTCCTCC CCTATATCAC ACGCGCTGTG CCGGCAGCCC TGCTCAGTAT GATAATTGTC
ACAATCGTGA CTACATTCCT CAACATCGAA CAATACGGAG TGCGCTTAAT CGGTGATATC
CCACCAGGTT TTCCACCACT GGCCCAATTG CCCTGGTTTG ATATTGGTTT GATCGGCCAG
TTGTTAAACG GCGCGCTTGC TTTGGCAATC ATTGGTTTAG TAGAGGCAGT CGCGATTGCA
CGGGCTATTG CCAGTTACAC CGGCCAGCGG ATTGATAGCA ACCAAGAGTT TGTTGGGCAG
GGATTGGCCA ACATCGTTTC TGGCCTCTTC TCAGGGATGC CGTGTTCTGG TTCGTTCAAC
CGTTCAGCAC TGGCTTATCA AGCCGGTGGT AAAACTGCAC TCACCGCCGT AATTTCTGGC
CTTACTGCAC TGGCAGGAAC GGCGATCTTG AGTCAAGTAT TAGCGATTAT TCCCCGACCG
GCTATTGCCG GAACACTGAT GGTCGCAGCA CTTGGTATGG TCGACCGGCG TGGTATGGCG
CGGGTTTGGC GCGGGTCACG GGGCGAGGCA GCTATTATGA TCATCACGCT CGTCCTCACC
CTGACCCTCC CCCTCCAATT CGCCATCCTC ACCGGTGTGC TGATGTCGCT CGGCTATTAC
CTCCTCCGTA CCGCTACCCC GCGCATTGAA GCCGTCGTCC CCGACACAGC GTTCCGCCAT
TGGGACCCCG CCCACGGCCG TCCCACCTGC CCTCAGCTCC TCGTCGTCGA CCTCCAAGGC
GACCTCTACT TCGGCGCCGC CAGCCACGTC GAAGAAGCCT TGCTCCGCCT CCTCGACCAG
CACCGCACTG TCCGCTACCT GCTCCTCCGA ATGCACAGCG TCAATCACTG CGATGTCAGC
GGCATTCGTG CCCTCGAAAC CATCCGCCGC ACCCTCCGCG TCCGCGGTGG CGACCTCTAC
TTCGTCCGTG TGCGTGCCGG CGTGATGTAT CGGATGCAGA TCAGCGGCTT TTATGAGCAA
CTCGGCCCCG AACGCTTCCT CGATGAAGAT ACCGCCATCG AGTTCCTCTT CCACCGCGTC
CTCGACCCCG CCGTCTGCAT CTACGAATGC GACCGCCGCG TCTTCCGCGA ATGCCAAGCA
TTGCCCAAAC AGACGCTGCC CGGCCACTTG ACCATCCCGC TGCTCGACGG CAAGCTGCCG
GCCCAGATCA ATGCCCGCGC CTTGTGGGAA GCGTTGCATA GCCCGCAACC GCCGGCGGTC
ATCGACGTGC GTGAACCGCG CGAGTTTCAG CGCGGCCATA TTCCCGGTGC GCGCAACATC
CCGCTCTCGC GGCTGCTCAG CGAGCGTGAT ACCGTGCCGG CGGGGCCGGT CGTGCTGGTC
TGTCGTAGCG GAAGGCGCAG CTTACGGGCA GCAGCGTTGC TGGTTGAACG CACCCCCCCG
CCGCAGGTGC TGGAGGGCGG GATGCTCGCG TGGGAAGCCG CCAACCTGCT CGAAGCCGTT
GAGCAGGCGT GA
 
Protein sequence
MSDYLRSVIT LFAQPIRLIR TYPIDALRAD FFAGLTVGLV LLPQSLAFAI LGGLPPIVGL 
YSAMTATIIG ALWGSSSHVN GGPSTTSAIL TLSVLLPIAP IGSAEFVTAA SMIAVIAGII
RLIMGIARLG MLVTFISDAV AVGFTAGSGL LILFNQFGPI TKLSISPGAS IPTIVQETVT
QLTEIHWPSL IIGASTIALI YLLPYITRAV PAALLSMIIV TIVTTFLNIE QYGVRLIGDI
PPGFPPLAQL PWFDIGLIGQ LLNGALALAI IGLVEAVAIA RAIASYTGQR IDSNQEFVGQ
GLANIVSGLF SGMPCSGSFN RSALAYQAGG KTALTAVISG LTALAGTAIL SQVLAIIPRP
AIAGTLMVAA LGMVDRRGMA RVWRGSRGEA AIMIITLVLT LTLPLQFAIL TGVLMSLGYY
LLRTATPRIE AVVPDTAFRH WDPAHGRPTC PQLLVVDLQG DLYFGAASHV EEALLRLLDQ
HRTVRYLLLR MHSVNHCDVS GIRALETIRR TLRVRGGDLY FVRVRAGVMY RMQISGFYEQ
LGPERFLDED TAIEFLFHRV LDPAVCIYEC DRRVFRECQA LPKQTLPGHL TIPLLDGKLP
AQINARALWE ALHSPQPPAV IDVREPREFQ RGHIPGARNI PLSRLLSERD TVPAGPVVLV
CRSGRRSLRA AALLVERTPP PQVLEGGMLA WEAANLLEAV EQA