Gene Acid345_0825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0825 
Symbol 
ID4072351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1025268 
End bp1026275 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content59% 
IMG OID637982834 
ProductABC transporter, substrate-binding protein, aliphatic sulphonates 
Protein accessionYP_589904 
Protein GI94967856 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.262946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.791109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAGCGC GGAGTGTTAC TGCCCTATTG TTTTTCCTTC TGGCGGCCGC CCTGTTGCCC 
GCACAGGTGG CGATTCGCGT GGGCTATTTC CCAAACGTTA CGCATGCGGA GGCACTCGTC
GGTCGCGCGA ACGGCCGCTT TGAACAAGCC CTGGGGAGTG CCACAAAACT CGAGTGGAAG
ACCTTCAACG CCGGGCCGTC GGGGATCGAG GCGCTTTTCG CAGGCGCCGT CGACTTGCTT
TATGTCGGTC CTAATCCAGC GATCACCGGA TACATCCGAT CTCAAGGCGA GGCGTTGCGT
GTGGTTGCCG GAGGCGCCAG TGGAGGCGCT TCTCTGGTGG TGCGTAAGGG CGCGAACATT
CGGAACGTTG AAGACTTTCG CGGCAAGAAG GTAGCGTCCC CACAACTGGG GAATACACAG
GATGTCGCTC TGCGGGCCTG GCTGTTGCAG AACCACCTTA AGAGCACCGA CAAAGGTGGC
GACGTGCAGA TTGTCCCGCT CGCCAATCCC GATCAGCTCA CACTCTTTCA AAAGGGACAG
CTCGATGCGT CATGGGCGCC GGAGCCCTGG GCTGCGCGGC TGATTCAGGA AGCCGATGGC
CAGATATTTC TTGATGAACG CTCCCTCTGG CCAGACCACC GGTTCGCCGT CACGGAGGTC
GTGGTACGGA CGGCATTTCT GCGCGAGCAT CCGGACCTGG TGAAGAAGTG GTTGAGCGTC
CACGTGGAAC TGGCGAACTG GATCAACAAG AACCCTGGCG AAGCGAAGGC GATTGTTAAT
CGGCAAATTC AGAGCGACAC CGGTAGAGCA CTGCCTTCGC GAGTGTTGGA CGAGGCGTTC
AGCCGGCTCG AAATCACCTA TGACCCAATC CGCTCGTCGT TGACCGTTGT TGCGGAACGC
GCCTATCAAG CGGGTTTCCT CAAGCAAAGG CCAGACCTGA GTCGCCTTTA CAGCTTTGAG
CTGCTCAACC AGGTGTTACG CGAAAAGAAC TTGCCGAACG TTGAATAG
 
Protein sequence
MKARSVTALL FFLLAAALLP AQVAIRVGYF PNVTHAEALV GRANGRFEQA LGSATKLEWK 
TFNAGPSGIE ALFAGAVDLL YVGPNPAITG YIRSQGEALR VVAGGASGGA SLVVRKGANI
RNVEDFRGKK VASPQLGNTQ DVALRAWLLQ NHLKSTDKGG DVQIVPLANP DQLTLFQKGQ
LDASWAPEPW AARLIQEADG QIFLDERSLW PDHRFAVTEV VVRTAFLREH PDLVKKWLSV
HVELANWINK NPGEAKAIVN RQIQSDTGRA LPSRVLDEAF SRLEITYDPI RSSLTVVAER
AYQAGFLKQR PDLSRLYSFE LLNQVLREKN LPNVE