Gene Acid345_4738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4738 
Symbol 
ID4070676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5601050 
End bp5602747 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content60% 
IMG OID637986782 
Productsulfate transporter 
Protein accessionYP_593811 
Protein GI94971763 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00377] anti-anti-sigma factor
[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.181577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCCA ATTCGCAGTC TCAGAGCCTC TTCTCGTTGC CGAATTGGCT GACTTCCTAC 
CGCTCCGAGT GGCTGCGTCC CGACATCATT GCCGGGCTCA CCGCCGCTGC CGTCGTCATC
CCAAAGTCGA TGGCGTATGC GACTATCGCC GGGCTGCCAG TGCAGGTCGG CCTCTACACG
GCGTTCCTGC CGATGATCAT CTACGCCGTT CTCGGCACCT CGCGTGTACT CAGCGTTAGC
ACAACGACAA CAATCGCCAT CCTTACTGCC GCGGAATTTG CGGAGGTGGT TCCGAACGGT
GACGCCGCCT CTCTGCTTCG AGCTTCAGCG ACCTTGACGC TCTTAGTTGG AGCGATGCTC
GTGGTTGCAT GCTTCCTGCG ATTGGGGTTC GTGGCGAACT TCATTTCACA ACCAGTACTC
ACCGGTTTTA AGGCCGGAAT CGGCATCGTT ATCGTGCTTG ACCAGGTGCC AAAGCTTCTC
GGAGTGCATA TCCCTCGCGC AACGTTTTTG AAGAACGTGC TGGCGACGCT CCGGAGTATT
CCTGAAACCA AGCTGCTCAC CCTCGGGGTG AGCGTAACGG TGATTGTTCT GCTGGTCGCT
CTCGAGCATT TCATGCCGAA GTCGCCAGCG CCCCTGATCG GGGTTGCTGT TGGCATTGCG
GGCGCCTATT TCCTGCATTT GAGCACGCAC GGCGTTGAAC TGGTCGGGCG CATTCCGCAA
GGATTGCCGC CGGTGACGCT TCCCGCGCTC GGGATGGTAG AGCATCTCTG GCCGGGTGCG
CTCGGGATTG CGCTGATGAG CTTTACCGAG ACGATTGCGG CGGGCCGCGC CTTCGCGAAG
AGCGACGAGC CTTGGCCGCA GGCCAACCGT GAGTTGATGG CTACGGGTTT GGCCAACGTA
GGCGGCGCTC TGCTCGGCGC CATGCCGGGC GGTGGTGGAA CTACGCAGAC AGCGGTGAAT
CGCCTCGCCG GAGCTCGCAC CCAGGTGGCG GAACTCGTGA CCGGCGCCAT GACGCTGGTG
ACCATGTTGC TTCTGGCGCC GATGATTGCC CTCATGCCAC AGGCGACGCT CGCGGCCGTG
GTGATCGTGT ACTCGGTAGG CTTAATCAAG CCTGCAGAGT TCCGCGAGAT CCTCCGTGTG
CGCCGAACCG AGTTCCTCTG GGCCGTGATC GCGATGGCAG GAGTGGTCCT TGTTGGAACG
TTAAAAGGCA TCCTCGTGGC AATCATCGCT TCGCTGGTCG CGCTTGCGTA TCAGGTCGCA
AATCCATCGG TCTATGTTTT GGGACGCAAG CCTGGCACGA ACATTTTCCG GCCGCGTTCG
GCAGAGCATC CTGAAGACGA GACCTATCCG GGGCTGCTGA TGGTGCGGCC GGAGGGCCGA
ATCTTCTTTG CCAATGCGGA GAATTTGTCG CATAAGGTCT GGGTGCTGAT AGATGAGGCG
AAGCCGAACG TCGTGATCGT GGACATGCGT GCGGTCTTCG ACCTCGAATA TACCGCCCTC
AAAATGTTCA CCGAAGGCGA GAAGAAGCAG CGCGAATATG GCATGCGTCT GTGGCTGGTT
GGGATGAATC CGCATGTGTT CGATATGGTG CAGAAGTCTG CGCTCGGAGA ATCGCTCGGA
CGGGAGGGGA TGCACCTGAA CCTGGAGAGC GCGGTGGCGA AGTACGCGGA GCATGCATCG
CTCCCGGCGA CGGTATAG
 
Protein sequence
MSSNSQSQSL FSLPNWLTSY RSEWLRPDII AGLTAAAVVI PKSMAYATIA GLPVQVGLYT 
AFLPMIIYAV LGTSRVLSVS TTTTIAILTA AEFAEVVPNG DAASLLRASA TLTLLVGAML
VVACFLRLGF VANFISQPVL TGFKAGIGIV IVLDQVPKLL GVHIPRATFL KNVLATLRSI
PETKLLTLGV SVTVIVLLVA LEHFMPKSPA PLIGVAVGIA GAYFLHLSTH GVELVGRIPQ
GLPPVTLPAL GMVEHLWPGA LGIALMSFTE TIAAGRAFAK SDEPWPQANR ELMATGLANV
GGALLGAMPG GGGTTQTAVN RLAGARTQVA ELVTGAMTLV TMLLLAPMIA LMPQATLAAV
VIVYSVGLIK PAEFREILRV RRTEFLWAVI AMAGVVLVGT LKGILVAIIA SLVALAYQVA
NPSVYVLGRK PGTNIFRPRS AEHPEDETYP GLLMVRPEGR IFFANAENLS HKVWVLIDEA
KPNVVIVDMR AVFDLEYTAL KMFTEGEKKQ REYGMRLWLV GMNPHVFDMV QKSALGESLG
REGMHLNLES AVAKYAEHAS LPATV