Gene Cagg_2116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2116 
Symbol 
ID7267623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2604022 
End bp2606133 
Gene Length2112 bp 
Protein Length703 aa 
Translation table11 
GC content59% 
IMG OID643566949 
Productsulphate transporter 
Protein accessionYP_002463438 
Protein GI219849005 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0607] Rhodanese-related sulfurtransferase
[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00377] anti-anti-sigma factor
[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.242384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCCT TTGGACACTC GGTCGCTCAG CTCTTCACCC GACCGGTCAG ACTGGTTCGC 
AGCTTCACCC CCGAAACCCT GCGTGCCGAC TTTCTTGCCG GATTGACCGT CGGATTGGTC
CTGTTACCGC AATCGCTTGC CTTCGCGCTA TTAGGTGGTT TACCACCCAT TACCGGCTTG
TATACTGCAT TGACCGCTAC TATTGTCGGT GCATTGTGGG GGTCGTCAAG TCATCTCAAT
AGTGGACCAA CAAACACAGC GGCCATTATC ACTTTATCGG TTCTAGCTCC GGTTGTCCGC
ATCGACAGCC CGGAGTTTGT CACGGCAGCG AGTTTGGTTG CGGTAATGGC CGGTATTATC
CGGGTTATCA TGGGTATTGC TCGCCTTGGC ATCTTGGTCA ATTTTGTCTC TGATGCGGTA
TCGGTGGGAT TTACCGCCGG CGCTGGTATT TTGATACTTT CAAATCAAAT CGGCCCATTG
TTACGCATCG ATCTCCCACC CGGTGCCGAT CCCATTACGA CGGTTACCGA AACAGCTCGT
CACCTTGATG CCATTCACTG GCCTTCGTTA GCGGTAGGAG TGGCTACGAT TGGCATCATT
TTGTTTTCAC CACGCATCAC CCGCAAAATA CCGTCCGTCT TGATTAGTAT CGTGATCGTT
TCACCGATCG TCTGGTTCCT CAATCTCAAA GCGCAGGGTG TGCGAGTAAT GGGACCGGTA
CCACCGGGTT TTCCACCGCT GGCCCAGTTA CCGATCTTCG ATCTAGATCT GATCAGCCAT
CTGTTGAATG GCGCGTTGGC ACTAGCTATT ATCGGCTCGG TAGAGGCAGT TGCAATTGCA
CGAGCAATTG CCGGCTATAC CGGTGAGCGT ATTGACAGCA ACCAAGAGTT TGTCGGACAG
GGGTTGGCCA ACATCGCCTC TGGCATCTTT TCAGGAATGC CATGCTCAAG TTCGTTCAAT
CGCTCGGCAT TGGCCTATCA GTCGGGAGGC CAAACGGCTC TTACCGGGGT TGTATCCGGG
ATAACAGTTT TTCTGGCAAC CACCGTCCTC GGTCCGTTAC TGGCCGAAGT CCCGCGGGCG
GCATTGGCCG GCGCATTGGC GGTGACCGCT TGGAGTATGG TTGACCGCCG GAATATGGCG
CGGATTTGGC GCGGGTCACG CAGTGAGGCA GCTATTATGA TCATCACGCT CGTCCTCACC
CTGACCCTCC CCCTCCAATT CGCCATCCTC ACCGGTGTGC TGATGTCGCT CGGCTATTAC
CTCCTCCGTA CCGCTACCCC GCGCATTGAA GCCGTCGTCC CCGACACAGC GTTCCGCCAT
TGGGACCCCG CCCACGGCCG TCCCACCTGC CCTCAGCTCC TCGTCGTCGA CCTCCAAGGC
GACCTCTACT TCGGCGCCGC CAGCCACGTC GAAGAAGCCT TGCTCCGCCT CCTCGACCAG
CACCGCACTG TCCGCTACCT GCTCCTCCGA ATGCACAGCG TCAATCACTG CGATGTCAGC
GGCATTCGTG CCCTCGAAAC CATCCGCCGC ACCCTCCGCG TCCGCGGTGG CGACCTCTAC
TTCGTCCGTG TGCGTGCCGG CGTGATGTAT CGGATGCAGA TCAGCGGCTT TTATGAGCAA
CTCGGCCCCG AACGCTTCCT CGATGAAGAT ACCGCCATCG AGTTCCTCTT CCACCGCGTC
CTCGACCCCG CCGTCTGCAT CTACGAATGC GACCGCCGCG TCTTCCGCGA ATGCCAAGCA
TTGCCCAAAC AGACGCTGCC CGGCCACTTG ACCATCCCGC TGCTCGACGG CAAGCTGCCG
GCCCAGATCA ATGCCCGCGC CTTGTGGGAA GCATTGCATA GCCCGCAACC GCCGGCGGTC
ATCGACGTGC GTGAACCGCG CGAGTTTCAG CGCGGCCATA TTCCCGGTGC GCGCAACATC
CCGCTCTCGC GGCTGCTCAG CGAGCGTGAT ACCGTGCCGG CGGGGCCGGT CGTGCTGGTC
TGTCGTAGCG GAAGGCGCAG CTTACGGGCA GCAGCGTTGC TGGTTGAACG CACCCCCCCG
CCGCAGGTGC TGGAGGGCGG GATGCTCGCG TGGGAAGCCG CCAACCTGCT CGAAGCCGTT
GAGCAGGCGT GA
 
Protein sequence
MLSFGHSVAQ LFTRPVRLVR SFTPETLRAD FLAGLTVGLV LLPQSLAFAL LGGLPPITGL 
YTALTATIVG ALWGSSSHLN SGPTNTAAII TLSVLAPVVR IDSPEFVTAA SLVAVMAGII
RVIMGIARLG ILVNFVSDAV SVGFTAGAGI LILSNQIGPL LRIDLPPGAD PITTVTETAR
HLDAIHWPSL AVGVATIGII LFSPRITRKI PSVLISIVIV SPIVWFLNLK AQGVRVMGPV
PPGFPPLAQL PIFDLDLISH LLNGALALAI IGSVEAVAIA RAIAGYTGER IDSNQEFVGQ
GLANIASGIF SGMPCSSSFN RSALAYQSGG QTALTGVVSG ITVFLATTVL GPLLAEVPRA
ALAGALAVTA WSMVDRRNMA RIWRGSRSEA AIMIITLVLT LTLPLQFAIL TGVLMSLGYY
LLRTATPRIE AVVPDTAFRH WDPAHGRPTC PQLLVVDLQG DLYFGAASHV EEALLRLLDQ
HRTVRYLLLR MHSVNHCDVS GIRALETIRR TLRVRGGDLY FVRVRAGVMY RMQISGFYEQ
LGPERFLDED TAIEFLFHRV LDPAVCIYEC DRRVFRECQA LPKQTLPGHL TIPLLDGKLP
AQINARALWE ALHSPQPPAV IDVREPREFQ RGHIPGARNI PLSRLLSERD TVPAGPVVLV
CRSGRRSLRA AALLVERTPP PQVLEGGMLA WEAANLLEAV EQA