Gene Oant_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_3039 
Symbol 
ID5381397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009668 
Strand
Start bp346916 
End bp348271 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content52% 
IMG OID640835716 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001371576 
Protein GI153010362 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0296274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAACA TTCTTACGTT GAACTTCAGA CGTGTTTTCG ACTTTCGCTA TGCCGGGTCG 
GCTGTGGGTG TTCTTGCCGC ATCCGTGACA TTGTTGATCG TGTGCGCCTT GCCGGCGCGC
GCAATCGAAA TTCAGGAAGT TGTTTCGCCA AAAGGTATTC ATGCCTGGCT GGTAGAAGAC
GATTCAGTCC CGCTTATTTC CATGCGGTTC TCATTCAAGG GTGGCACATC GCAGGATCCT
TCCGGGAAGG AAGGGTTGGC TAACCTGATG ACAGGGCTGT TCGATGAGGG GGCGGGCGAC
CTGAAATCGG ATGCGTTTCA GGAGAAGATA GACAATCTGG GCGCGGAAAT GAGCTTTTCT
GCAACACAGG ATTCCGTTTC GGGCGGTATT CGCATGCTCG CGGAAAATCG CGACGCAGCC
ACAAACTTGC TTGCTCTTTC TGTCAACAAG CCTCGCTTCG ACCAAGATGC TATTGATCGT
ATTCGACAGC AGGTGGTTGC GAGTATCGAA TCCTCACAAC GCAACCCTTC GACGATTGCA
TCGCGTAAGT TCTCCGAAGT TCTCTATGGG AACCATCCTT ATGCACGTCC CGATGATGGC
ACGGTGAAAT CACTGCAGTC GATCACGCGT GACGATCTGG TGAACTTTCA CCGCAAGAAC
TTTGCGCGTG ACCGTCTGAC TATCGGCGTT GTGGGTTCGA TCAATGCGAA GGATTTGGAG
GCGTTGCTGG ATAAGGTGTT CGGCGATCTC CCCGCGATGG CCGAACTGGT TCCTGTACCC
GATGCCAAAC TGGCACTTGG CACGACGACC AGCCTCAATT TCGATATGCC GCAGACCTCG
ATCAGCTTTG TCTATCCGGC TATTCCGCGC AAGGATCCCG AATTCTTTGC AGCCTATCTG
ATGAACCATA TTCTGGGTGG TGGCTTTACA TCGCGTCTTT ATGCCGAAGT GCGTGAAAAG
CGTGGCCTTG CCTATTCGGT ATCGTCATCC ATGGTCATGC GTGATCATGT TTCGGCATTG
ATGATTTCGA CTGCGACCCG TCCTGACAAA GCGCAGGAAT CCCTGAAGAT TATCCGCGAG
CAGGTTGCTG CTATGGCCGC GGACGGGCCC ACGGAAGAAG AACTTGCTGC TGCCAAAAAC
TTCCTCAAGG GATCTTACGC CGTCAACAAT CTGGATTCAT CCGCCGCTAT TGCTGAAACT
CTCGTTAGCT TGCAGGAAGC AGAACTTCCG CGTGATTATA TCGACAAGCG CTCGGAGTTG
ATTGATGCTG TAACGTTGGA TCAGGTCAAG GCTATTGCTA AGAAACTGCT TGAAGCGGAA
CCGGCAATAT TGATTTTCGG CCCGGCCCAA AGCTAA
 
Protein sequence
MKNILTLNFR RVFDFRYAGS AVGVLAASVT LLIVCALPAR AIEIQEVVSP KGIHAWLVED 
DSVPLISMRF SFKGGTSQDP SGKEGLANLM TGLFDEGAGD LKSDAFQEKI DNLGAEMSFS
ATQDSVSGGI RMLAENRDAA TNLLALSVNK PRFDQDAIDR IRQQVVASIE SSQRNPSTIA
SRKFSEVLYG NHPYARPDDG TVKSLQSITR DDLVNFHRKN FARDRLTIGV VGSINAKDLE
ALLDKVFGDL PAMAELVPVP DAKLALGTTT SLNFDMPQTS ISFVYPAIPR KDPEFFAAYL
MNHILGGGFT SRLYAEVREK RGLAYSVSSS MVMRDHVSAL MISTATRPDK AQESLKIIRE
QVAAMAADGP TEEELAAAKN FLKGSYAVNN LDSSAAIAET LVSLQEAELP RDYIDKRSEL
IDAVTLDQVK AIAKKLLEAE PAILIFGPAQ S