Gene Oant_3909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_3909 
Symbol 
ID5381385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009668 
Strand
Start bp1319601 
End bp1320617 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content58% 
IMG OID640836594 
Productputative simple sugar transport system substrate-binding protein 
Protein accessionYP_001372443 
Protein GI153011229 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.344207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCG CAAGAATCCT TTTCGCATCC GCAGCACTGG CAGGCGTTCT CGCCGCTGGC 
AGCGCGATGG CTGACACATC GTCGAAGAAG ATTGCATTCT CCAACAATTA TGCTGGCAAC
TCATGGCGCC AGGCCATGCT GCAAAGCTGG GACAAGATCA CCAAGGAAGC CGTGAAGGCC
GGTGTGGTCG CTGCGGCTGA CCCGTTCACG ACGGCTGAAA ATCAGGCCAC AGAGCAGGCC
GCACAGATCC AGAACATGAT CCTGCAAGGC TATGATGCCA TCGTCATCAA TGCCGCTTCG
CCAACCGCTT TGAACGGTGC AATCAAGGAA GCCTGCGATG CGGGCATCAC GGTCGTTTCC
TTTGACGGCA CCGTCACGGA GCCTTGCGCA TGGCGCATCG CGGTCGATTT CAAGGCAATG
GGCGAAGGCC AGATCGATTA TCTCGCAAAG CGCTTCCCCG ATGGCGGCAA CCTGCTTGAA
ATTCGCGGTC TTGCCGGTGT TTCGGTCGAT GACAATATCC ATGCTGGCAT CGAAGAAGGC
GTGAAGAAGC ATCCGAAATT CAAGATCGTC GGCTCCGTCA ATGGCGACTG GGCGGCGGAC
GTGGCACAGC GTGCCGTCGC TGGCATCCTC CCAAGCCTGC CGAAAATCGA CGCAGTCGTG
ACGCAGGGCG GCGACGGTTA TGGTGCTGCG CAGGCCTTCG CCGCCGCCAA GCGCGAAACG
CCGATCATCA TCATGGGCAA CCGTGAAGAC GAACTGCAGT GGTGGAAGCA ACAGAAGGAC
GCCAATGGCT ACGAAACCAT GTCGGTTTCG ATTGCACCCG GCGTTTCAAC GCTTGCTTTC
TGGGTTGCCC AGCAGATTCT CGACGGCAAG GACGTGAAGA AGGACCTCGT GGTTCCGTTC
CTCAGCGTCA GCCAGGAATC GCTCGACAAG GATCTGGCCA ACACCCAGAA GGGTGGCGTC
GCCAATGTCG AATATTCGCT GGAAGACGCG CAGAAGGTCA TCGACGCGGC CAAGTAA
 
Protein sequence
MKLARILFAS AALAGVLAAG SAMADTSSKK IAFSNNYAGN SWRQAMLQSW DKITKEAVKA 
GVVAAADPFT TAENQATEQA AQIQNMILQG YDAIVINAAS PTALNGAIKE ACDAGITVVS
FDGTVTEPCA WRIAVDFKAM GEGQIDYLAK RFPDGGNLLE IRGLAGVSVD DNIHAGIEEG
VKKHPKFKIV GSVNGDWAAD VAQRAVAGIL PSLPKIDAVV TQGGDGYGAA QAFAAAKRET
PIIIMGNRED ELQWWKQQKD ANGYETMSVS IAPGVSTLAF WVAQQILDGK DVKKDLVVPF
LSVSQESLDK DLANTQKGGV ANVEYSLEDA QKVIDAAK