Gene Oant_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_2089 
Symbol 
ID5379662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009667 
Strand
Start bp2185870 
End bp2187372 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content56% 
IMG OID640834758 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001370634 
Protein GI153009419 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGAGT TGCTGACCCC GCAGGAAATG GCAGAAGCGG ATCGCCAGAC GATCGAGACC 
GGTATCAAGG ATGGTTTCTC ACTGATGCTT GCTGCAGGCC GGGCCGTCGC AGAGGTTGCA
CAGCGGATGT TTCCTCGCAA TGCTCCAGTT GCTGTTCTTT GCGGGCCTGG CAACAATGGT
GGCGACGGCT ACATCGCGGC GCAGTTTCTC CTGGAAGCGG GTTTGGAAGT CGTGTGCTTC
GCATCCGCTC CCCCGAGGAA AGGCTCCGAT GCCATGCGTG CGTCGTTGTT CTACAAGGGG
CCTGTCTCGA ACTTCGGTGA ATTTTCGGTT GCGAGTTTCG GCGGCGTAAT CGATGCATTA
TTCGGCGCTG GCCTCGCTCG CGATGTTCAC GGAGCCGAAG CCATCGCGAT TGATGCTGTG
AACACTTCTG ATGTGCCGGT TGTGGCGGTT GATTTACCTA GCGGCATATC CGGTGAAAGC
GGTCATATCC TCGGAACGGC GATCCGTGCA CGAGCAACAG TTACGTTCTT TCGCAAGAAA
CCCGGCCATT TGCTACAACC GGGAAGGGCG CATTGCGGTG TTCTCCATGT GGCGGATATA
GGCATTCCGG ATCACATACT TACTGCGATC GAACCGCATG CATTCGAGAA TGCTCCAGAC
CTTTGGGCAA GCGCCCTGCC AGTATCTGGC ATTGAGGCTC ATAAGTATAG TCGCGGCCAT
GCAGCCGTGT TTTCCGGCGG CGTTCATTCT ACAGGTGCCG CACGCTTATC AGCCCTCGCT
GCGGCCCGTA GTGGGGCGGG TGCGGTAACG CTTCTGTCTC CGCCCGATGC TATGACGGTC
AATGCCACGC ATCTGACCAG TATCATGGTC CGTGAGACCC GTGGTTTGAG GGATATTCGA
CAGTTTTTTG CCGAACGTAA GGTCGCATCA GCTGTCCTTG GTCCGGGTTA TGGAAATCCC
GTTTTGGCGC GTGAATATGC CCTGTTTCTA GCAAGTGACG AATCAGCGGG CTTTCATGGT
TTGGTTCTTG ATGCCGACGG GATTACGGCC TTTGAGCAGA ATCCAGCTGA GCTTTTCGAT
AAGCGACAGG GCATAAAGAC TGCTCTTGTG CTTACACCAC ATGAGGGAGA GTTCCGGCGG
CTTTTTCCAG ATATAGCAAA GAGTTCGACA TCAAAGATCG AAAAGGCTCG CGAGGCTGCA
AGGCGTGCTA ACGCAGTCCT TATATATAAA GGTCCCGATA CCGTGATCGC AGAACCGGGT
GGTCTGGCAG TGGTCAATGC CAATGGAACC CCGCTGCTCG CCACTGCAGG ATCAGGCGAT
GTCTTGACGG GAATTGTGTG CGGCCTGCTT GCTCAAAGGA TGCCCGCATT TGCAGCGGCG
TGTGCTTCCG TGTGGATGCA TGCGGACGCC GCGCGACGCT TTGGCCATGG ACTTATTGCG
GAAGATTTGC CTGCTCAGAT ACCTGCTGCC CTGTCGGCAT TGGTGCGATC CTCTCGCAAT
TAA
 
Protein sequence
MYELLTPQEM AEADRQTIET GIKDGFSLML AAGRAVAEVA QRMFPRNAPV AVLCGPGNNG 
GDGYIAAQFL LEAGLEVVCF ASAPPRKGSD AMRASLFYKG PVSNFGEFSV ASFGGVIDAL
FGAGLARDVH GAEAIAIDAV NTSDVPVVAV DLPSGISGES GHILGTAIRA RATVTFFRKK
PGHLLQPGRA HCGVLHVADI GIPDHILTAI EPHAFENAPD LWASALPVSG IEAHKYSRGH
AAVFSGGVHS TGAARLSALA AARSGAGAVT LLSPPDAMTV NATHLTSIMV RETRGLRDIR
QFFAERKVAS AVLGPGYGNP VLAREYALFL ASDESAGFHG LVLDADGITA FEQNPAELFD
KRQGIKTALV LTPHEGEFRR LFPDIAKSST SKIEKAREAA RRANAVLIYK GPDTVIAEPG
GLAVVNANGT PLLATAGSGD VLTGIVCGLL AQRMPAFAAA CASVWMHADA ARRFGHGLIA
EDLPAQIPAA LSALVRSSRN