Gene Clim_1648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1648 
Symbol 
ID6353955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1786023 
End bp1787729 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content52% 
IMG OID642669254 
Productsulfate transporter 
Protein accessionYP_001943670 
Protein GI189347141 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.423895 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAAC CCAAGTTCTT CAGCGTTCTT TCGGAATTAA CCCCCCAGCA GATTGCCCGC 
GATGTCACAT CCGGCATACT TGTCGGCATT GTTGCCCTTC CTCTTGCGAT AGCCTTTGCC
ATCGCATCCG GAGTATCACC GGAAAAAGGT CTCATCACCG CTGTCATTGG CGGTTTTATC
GTATCATTTC TTGGAGGCAG CCGTGTACAG ATAGGGGGCC CTACCGGAGC ATTTATCGTT
ATCCTCTACG GCATCGTAGA GCAGTACGGC GTCAACGGCC TGATGATTGC AACCATAATG
GCCGGCGTTA TTCTCATGCT CATGGGGTTC GCCCAGTTCG GTTCGCTCAT CAAGTTCATC
CCGTATCCGG TTGTAGTAGG CTTTACAAGC GGCATCGCAG TCATAATCTT TTCAAGCCAG
ATCAGCGATT TTCTGGGCCT CGGCATCGAC AAGGTTCCAG CCGACTTTAT CGATAAATGG
ATCGCTTACG GTCGACATCT GTCGGCCATA AACCCGGAAA GTTTTCTTGT CGGCCTGCTC
TCGCTCCTCA TTATTGTGTT CTGGCCCAGG GTTTCCAAAA AAATACCCGG ATCGATCATC
GCTATCATTG CCGCAACTGT TCTGGCACAC TATCTCAAGC TCGACGTTGC CACAATCGAA
TCCCGATTCG GGGAAATTCC CTCTGCCCTT CCTGCTCCTG CTTTTCCGGT TTTTGATTTT
GCTACGGTAA AACAGCTCAT CATGCCGGCC ACGACGATTG CACTGCTCGG AGCAATCGAA
GCTCTGCTCT CCGCCGTGGT ATCCGACGGC ATGATCGGCA GTCGGCATAA ATCGAACATG
GAACTGGTCG CCCAGGGCGC TGCAAACATC ATCTCTCCGC TTTTCGGAGG CATTCCGGTC
ACCGGAGCGA TCGCACGCAC GGCAACCAAC GTCAAAAACG GTGGACGCAC ACCCGTTGCC
GGCATGGTTC ATGCCCTCAC GCTGCTGCTG ATCATGATGC TGTTCGGTAA ATGGGCAAAG
CTCATACCGA TGCCGACACT TGCCGCCATC CTGATTGTGG TTGCCTGGAA CATGAGCGAA
CACAATGTGT TCCGCAAGCT CCTCAAAAGC CCGAAAAGCG ACGTCGTCGT TCTCCTGACT
ACATTTGGGC TGACCGTTGT TTTCGACCTG ACGATAGCCA TTGAAATCGG CATGCTGCTT
TCGGTTTTCC TCTTTATGCA GCACATGGCA AGCCTTGCCA ACGTCAACAT CATTACAAGG
GAGCTGAAAG ACCGCGAGGA GGAGGACGAC CCGAACACCA TCAGTACAAG AAGCGTACCG
GATGGCGTCG AGGTTTTTGA AATCAGCGGA GCAATGTTTT TCGGTGCCGC TTCGAAGTTC
AAGGATGCCA TGCATATCGT AGAAAAAGCA CCAAAAGTCC GCATCATCAG AATGAGGAAT
GTGCTCTCTA TCGATGCAAC CGGCCTCAAC ATGCTCCAGG AACTGCTTGC CGATTCAAGA
AAAACCTCAA CGCATCTTAT TCTTTCGGGC GTCCACGCCC AGCCGCTTTT TGCCATGCAG
CAATACGGCA TTTATGATGA CATCGGTGAA ACGAACATTT TCGGCCACAT CGATGATGCT
CTCGACCGGT CAAGAGAGCT GCTCGGTCTG CCAAAAATGG GAAAACAGAA GGATTTCGTT
CCATCGGTGC AGCGCGATAA CCAATGA
 
Protein sequence
MFKPKFFSVL SELTPQQIAR DVTSGILVGI VALPLAIAFA IASGVSPEKG LITAVIGGFI 
VSFLGGSRVQ IGGPTGAFIV ILYGIVEQYG VNGLMIATIM AGVILMLMGF AQFGSLIKFI
PYPVVVGFTS GIAVIIFSSQ ISDFLGLGID KVPADFIDKW IAYGRHLSAI NPESFLVGLL
SLLIIVFWPR VSKKIPGSII AIIAATVLAH YLKLDVATIE SRFGEIPSAL PAPAFPVFDF
ATVKQLIMPA TTIALLGAIE ALLSAVVSDG MIGSRHKSNM ELVAQGAANI ISPLFGGIPV
TGAIARTATN VKNGGRTPVA GMVHALTLLL IMMLFGKWAK LIPMPTLAAI LIVVAWNMSE
HNVFRKLLKS PKSDVVVLLT TFGLTVVFDL TIAIEIGMLL SVFLFMQHMA SLANVNIITR
ELKDREEEDD PNTISTRSVP DGVEVFEISG AMFFGAASKF KDAMHIVEKA PKVRIIRMRN
VLSIDATGLN MLQELLADSR KTSTHLILSG VHAQPLFAMQ QYGIYDDIGE TNIFGHIDDA
LDRSRELLGL PKMGKQKDFV PSVQRDNQ