Gene Sama_2209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2209 
Symbol 
ID4604459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2670054 
End bp2671073 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content53% 
IMG OID639781606 
ProductC4-dicarboxylate-binding periplasmic protein 
Protein accessionYP_928084 
Protein GI119775344 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.160504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAT CAAAACCAGC GACCCTGCAA TCTCTCTTTA CCCTAGGCAA AGCCAGCCTG 
CTGGCAACCG TGCTGGGATT CAGCTTCGGT GCAGTCGCCG AACCGGTAGA AATCAAGTTC
TCCCACGTGG TAGCGGAAAA CACCCCCAAA GGCCAAATGG CGCTCAAGTT TAAAGAGTTG
GTGGAAAGCC GTCTTCCCGG TGAATATAAG GTGAGTGTAT TTCCCAACTC ACAGCTCTTT
GGTGACAACA ACGAACTGGC GGCACTGCTG CTGAACGATG TACAGCTGGT AGCGCCATCC
CTGTCCAAGT TCGAGCGCTA TACCAAAAAA CTGCAGGTAT TCGATCTGCC CTTCCTGTTT
GAAGACATGG ATGCGGTGGA CCGCTTCCAA CAGAGTGAAG CTGGCCAGCA ACTGCTGAAC
TCTATGAGCC GCAAAGGCCT GGTTGGTTTG GGCTATCTGC ACAATGGGAT GAAGCAGTTT
TCGGCCAACA ATGCCCTGTC ACTGCCAGGC GACGCCGCCG GTAAGAAATT CCGCATCATG
CCTTCCGATG TGATTGCAGC GCAGTTTGAG GCCGTGGGTG CCATCCCGGT GAAAAAGCCG
TTCTCCGAAG TCTTTACCCT GCTGCAGACC CGCGCCATCG ATGGCCAGGA AAACACCTGG
TCCAATATCT ATTCCAAGAA GTTTTATGAA GTACAGACTC ACATTACCGA GAGCAATCAC
GGCGTACTCG ACTATATGTT GGTCACCTCT GAAACCTTCT GGAAGAGTCT GCCCAAGGAC
AAACGCGAAA TCATCAAGCA GTCCATGGAC GAAGCCGTTG CCCTTGGGAA CAAACTGGCT
CTGGAAAAAG CCAACGAAGA TCGTCAGCTC ATCCTCGACT CCAAGCGTGT TGAGCTGGTG
ACCCTGACCC CCGAGCAGCG CCAGGCCTGG GTTAATGCCA TGCGTCCTGT CTGGTCACAG
TTTGAAGACA AGATTGGTAA AGACCTGATT GAAGCCGCCG AGTCTGCCAA CAAGCCGTAA
 
Protein sequence
MKVSKPATLQ SLFTLGKASL LATVLGFSFG AVAEPVEIKF SHVVAENTPK GQMALKFKEL 
VESRLPGEYK VSVFPNSQLF GDNNELAALL LNDVQLVAPS LSKFERYTKK LQVFDLPFLF
EDMDAVDRFQ QSEAGQQLLN SMSRKGLVGL GYLHNGMKQF SANNALSLPG DAAGKKFRIM
PSDVIAAQFE AVGAIPVKKP FSEVFTLLQT RAIDGQENTW SNIYSKKFYE VQTHITESNH
GVLDYMLVTS ETFWKSLPKD KREIIKQSMD EAVALGNKLA LEKANEDRQL ILDSKRVELV
TLTPEQRQAW VNAMRPVWSQ FEDKIGKDLI EAAESANKP