Gene Oant_3467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_3467 
Symbol 
ID5381447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009668 
Strand
Start bp825542 
End bp826720 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content55% 
IMG OID640836149 
Productectoine utilization protein EutD 
Protein accessionYP_001372002 
Protein GI153010788 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID[TIGR02993] ectoine utilization protein EutD 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTGG AACTCAACTT CACCCGCGCG GAATATGATG AGCGGATAGC CAAAACCCGT 
CGCGCCATGG AGAAGGCAGG CTTCGATGTC ATCATCGTCA CAGACCCGTC CAACATACAC
TGGCTTACCG GTTATGATGG CTGGTCCTTC TATGTCCATC AATGCGTTGT GCTTTCGATG
GAAGGCGAAC CGATCTGGTA CGGTCGCGGT CAGGATGGCA ACGGTGCGAA ACGCACCGCG
TGGATAAGCC ATGACAACAT CATCGGCTAT CCTGACCACT ACGTGCAATC GCTGGAACGC
CACCCGATGG ATCTTCTGGC CTCAACCCTT GAGGAAAAGG GTTGGGGTAA CAAGACGATA
GCCGTCGAGT TCGACAATTA CTGGTACACT GCTGCGGCCC ACCATGCTTT GCAGAAGCAT
CTGCCCAATG CACAGTTCAA AGATGCGCAA GGTCTCGTCA ACTGGCAGCG GGCGGTGAAG
AGTTCAACCG AGATTGGCTA TATGCGCAAA GCCGGGCGCA TCGTGGAAGC CATGCACCAG
CGCATCGTCG ATAAAATCGA GCCGGGTATG CGTAAATGCG ATCTGGTTGC AGAAATCTAT
GATGCAGGCA CACGCGGCGT TGACGATTTC GGCGGCGATT ATCCTGCCAT TGTGCCGTTG
CTGCCTTCTG GACCTGACGC CTCCGCCCCG CATCTCACCT GGAACGATTT GCCGATGAAG
ACCGGAGAAG GCACCTTCTT CGAGATTGCA GGTTGCTATA AGCGTTATCA TTGCCCACTG
TCGCGGACTG TATTTCTCGG CAAGCCGACG CAGGCTTTTC TTGATGCAGA AAAAGCAACA
CTCGAAGGTA TGGAAGCGGG ACTTGCTGCT GCTCGTCCCG GCAATACCTG CGAAGATATT
GCCAACGGCT TCTTCGCTGT TTTGAAGAAA TACGGGATCA TCAAGGACAA CCGCACCGGC
TATTCCATCG GCCTGTCCTA TCCGCCGGAT TGGGGCGAAC GCACCATGAG CCTGCGCCCC
GGCGATCACA CCGAATTGCA GCCCGGCATG ACCTTTCACT TCATGACAGG TCTCTGGCTG
GAAACCATGG GGCTGGAGAT CACCGAGAGC ATCGTGATCA CCGAAACCGG CGTCGAATGC
CTGTCGAATG TGCCACGCAA GCTCGTGGTC AAGAATTAG
 
Protein sequence
MSVELNFTRA EYDERIAKTR RAMEKAGFDV IIVTDPSNIH WLTGYDGWSF YVHQCVVLSM 
EGEPIWYGRG QDGNGAKRTA WISHDNIIGY PDHYVQSLER HPMDLLASTL EEKGWGNKTI
AVEFDNYWYT AAAHHALQKH LPNAQFKDAQ GLVNWQRAVK SSTEIGYMRK AGRIVEAMHQ
RIVDKIEPGM RKCDLVAEIY DAGTRGVDDF GGDYPAIVPL LPSGPDASAP HLTWNDLPMK
TGEGTFFEIA GCYKRYHCPL SRTVFLGKPT QAFLDAEKAT LEGMEAGLAA ARPGNTCEDI
ANGFFAVLKK YGIIKDNRTG YSIGLSYPPD WGERTMSLRP GDHTELQPGM TFHFMTGLWL
ETMGLEITES IVITETGVEC LSNVPRKLVV KN