Gene Acid345_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0223 
Symbol 
ID4071676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp234034 
End bp235695 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content60% 
IMG OID637982224 
Productsulphate transporter 
Protein accessionYP_589302 
Protein GI94967254 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00377] anti-anti-sigma factor
[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGAGT GGCTACCGAA GTCCGTTCTT GCGCTGCGTG ATTACTCCCG CCAGCGCTTT 
GTTGCCGACC TGCTTGCCGG CATTACTGTG GGGCTGGTTG CGCTTCCGCT GGCGATGGCG
TTTGCCATTG CTTCTGGCGT GCCGCCGCAG AGCGGACTGT ATTGCGCGAT CGTCGCCGGG
TTCCTCGTCT CGGCGTGCGG TGGTTCGCTG ACACAGATCG GTGGCCCGAC CGGCGCGTTC
GTCGTCGTGG TCTACAACAT TGTCGCCAAG CATGGGATCG ATGGGTTGTT CATGTGCACG
CTCGAAGCCG GAGTGATTCT TGTGCTGCTC GGCATCACCG GACTGGGGTC GGCGGTGAAG
TTCATTCCGC GGCCGGTGGT GGTTGGATTC ACCAACGGGA TCGCGGTGAT CATCGCGAGC
ACGCAGATTA AAGACTTCTT CGGACTGAAG ATTGAAAAAG TGCCGGGCGA TTTTCTCGAT
CGCATGGAAG TTCTCGGCAA AAACTTCCGA ACGCTTTCTG TTGAGGAAAC GTGTATTGGC
TTGCTGGCGC TGGCGATCAT CATTGCCTTC ATGCGCTATG TGAAGCGCGT GCCGGGCTAT
ATCGTGGCGC TGGTCGCGGG CACTGCCGCG GTGCTGCTTT TGCATCTCGA CGTGCAGACC
ATCGGAACGC GCTTTGGAGG AATCCCATCG GGCTTGCCGA AGCTGGAGAT CCCTCAGTTT
CACGCCAACC TGCTACGACC GCTGATCTCG CCCGCTCTGA CCGTGGCAAT GCTGGGCGCG
ATCGAGTCGT TGATGTCAGC CGTGGTTTCC GATCGGCTCA GCGGCGATAA GCACAACCCG
AATGTTGAAC TGGTCGGCCA GGGCATTGCC AATATCTTTT CGCCGCTCTT TGGCGGACTG
CCTGCCACGG GCGCGATTGC GCGTACTGCG ACGAACATCC GCTCCGGCGC GACCTCGCCC
GTCGCTGGAA TGATTCACTC GGCGACGTTG CTGGCGATTG TGGTCTTTGC GGCGCCGGCA
GCGAAGTTCA TTCCGCTCGC GGTGCTCTCG GCGATCCTGT TCGTGGTGGC CTACAACATG
GGCGAGTGGC GCGAGATCCC GCAGATCCTC AAGCTCTCGA AGCTGGAGAT AGGTACGTGG
CTTGCGTCCT TCTTGTTGAC CGTCTTCGCG GACCTGACCA CCGCCGTCGA AGCCGGAATG
ATCATGGCAG TGCTCGTCTT CATTCGCCGG GTGTCGCTCA CCACAACCGT GAGCATGGTG
ACCGAAGAGT ACGTGAAAGC CGGGCATGCA CACATCCTCC AGAACAAGGA CATTCCGGGC
TACGTCGCGA TCTTCCGCAT TCACGGCCCG TTCTTGTTCG GCACTACGGA GAAGATGGAA
GAGATCACCT CGCGTATTGA CGAGCTTCCG CCGGTCGTCA TCGTGCGCTT GCGCAACATG
ACCGCGATTG ACGCCACCGG TATTCAGGCT TTAGAGACCG CGATCGAAGC GATACGCAGG
ACGGGGCGCA AGGTGCTGCT GTGTGGCGCT CGTCGACAAC CCAAGGAACT GATCGAACAG
TCGGGGTTGG GTGCGCACGT GGGCGAGGAG AATATGCTCA ACAGCATTTC CGACGCGCTG
GAGCGCGCGA AGGCGCTCCA GACCAGCGTA GTACACGCCT AA
 
Protein sequence
MHEWLPKSVL ALRDYSRQRF VADLLAGITV GLVALPLAMA FAIASGVPPQ SGLYCAIVAG 
FLVSACGGSL TQIGGPTGAF VVVVYNIVAK HGIDGLFMCT LEAGVILVLL GITGLGSAVK
FIPRPVVVGF TNGIAVIIAS TQIKDFFGLK IEKVPGDFLD RMEVLGKNFR TLSVEETCIG
LLALAIIIAF MRYVKRVPGY IVALVAGTAA VLLLHLDVQT IGTRFGGIPS GLPKLEIPQF
HANLLRPLIS PALTVAMLGA IESLMSAVVS DRLSGDKHNP NVELVGQGIA NIFSPLFGGL
PATGAIARTA TNIRSGATSP VAGMIHSATL LAIVVFAAPA AKFIPLAVLS AILFVVAYNM
GEWREIPQIL KLSKLEIGTW LASFLLTVFA DLTTAVEAGM IMAVLVFIRR VSLTTTVSMV
TEEYVKAGHA HILQNKDIPG YVAIFRIHGP FLFGTTEKME EITSRIDELP PVVIVRLRNM
TAIDATGIQA LETAIEAIRR TGRKVLLCGA RRQPKELIEQ SGLGAHVGEE NMLNSISDAL
ERAKALQTSV VHA