Gene Acid345_4006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4006 
Symbol 
ID4071142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4732716 
End bp4733699 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content56% 
IMG OID637986033 
ProductABC sugar transporter, periplasmic ligand binding protein 
Protein accessionYP_593080 
Protein GI94971032 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.82922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCAC CCAAGAAAAT CGCTCTGACA CTTATTGCCG CAGCCTCCTG TCTCCTTGCC 
GGCTGTGCCA AGCATGACAA CGACGAGAAG TACATCCTGG TAACGGTCAA TTCGAAGGTT
GAGTATTGGA AGACCGCGCA GGCGGGCCTG ACCAAAGCCG CTGCGCAATA CGGCGTGAAA
TGGGACGTCC GCGGTCCTGA AAACTATGAT CCCCAAGCGG AGGTGCAGGA GTTCCGCAAT
GCTGCGGCAC AGAAACCTTC CGGCATCCTG GTGTCCGTCG CCGATGCTTC GCTGATGCAA
CCGGCAATTG ACGAGGCTAT TAACGCAGGC ATTCCCGTCC TCACCATCGA TTCCGATGCC
CCGAAGAGCA AGCGCCTTTA CTTCATCGGT ACCAACAACC GCCAAGCCGG TACGCTCGGC
GCAAAACGCC TGGTCGAGAA GCTTCACGGG AAGGGCAATG TCGTCTTCTT CACCATGCCC
CAACCGAACC TCGACGAACG GTTAGCGGGC TATAAAGACG TTCTCTCTGA CAATCCCGGT
ATCAAGATCG TGGAGGTCGT GAACATCAAA GGTGATTCCG GCAATGCCTT TGACCGCACC
GCGCATTATG CTGGCGCCAA AGATGCTCAG AAAATCGACG CCTTCGTCTG CCTGGAGGCG
ACGTCGGCGA AGGATGTCGC GCTTGCGCTG AAACGCGAAA ACGTAACCGA CCGGCTGGTA
ATTGCAATGG ACGTTGATCC CGCTACACTC GACCTCATTA AGTCAGGCGT GGTGGATGCG
ACCATTGCGC AGAAGCCCTA CACCATGGCG TTTTATGGAC TGAAGGCCCT CGATGAGATA
CATCACGGAA AGCCAGATCT CACCAAGGAC TACTCGTTCG ACTCGTTCTC GCCATTCCCA
GCGTTTGTCG ATACCGGCAC CTCAGTTGTT GACAAGACAA ACGTGGATCT CTATCTGCAA
GCGCGAGCTG CGAACGCAAA ATAA
 
Protein sequence
MSAPKKIALT LIAAASCLLA GCAKHDNDEK YILVTVNSKV EYWKTAQAGL TKAAAQYGVK 
WDVRGPENYD PQAEVQEFRN AAAQKPSGIL VSVADASLMQ PAIDEAINAG IPVLTIDSDA
PKSKRLYFIG TNNRQAGTLG AKRLVEKLHG KGNVVFFTMP QPNLDERLAG YKDVLSDNPG
IKIVEVVNIK GDSGNAFDRT AHYAGAKDAQ KIDAFVCLEA TSAKDVALAL KRENVTDRLV
IAMDVDPATL DLIKSGVVDA TIAQKPYTMA FYGLKALDEI HHGKPDLTKD YSFDSFSPFP
AFVDTGTSVV DKTNVDLYLQ ARAANAK