Gene Oant_3762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_3762 
Symbol 
ID5382258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009668 
Strand
Start bp1149159 
End bp1150328 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content54% 
IMG OID640836448 
Producthypothetical protein 
Protein accessionYP_001372297 
Protein GI153011083 
COG category[R] General function prediction only 
COG ID[COG3970] Fumarylacetoacetate (FAA) hydrolase family protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAAGACA AGCGTGAAGT GAAAGCAACG GCGATTTTGC CCGAGGACAG CCATTCTGCG 
ATCTTGATCG GTCGCGTATG GTCGAAGGCC GAAAACGGCC CTTGTCCAGT ACTGGTGAAA
GACGGTCATG TGCACGATAT ATCCAGTTTG TCCGCTACAG TTTCGGGTCT TCTGGAGCAC
AGCTCACTGT TACGCGAGTT AGATGGCACA GAAAACTTCA CCAATCTCGG CTCTCTGGAC
GATTTTCTTT CCGGTGATCG CGGAGAGCTG CTTGCTCCCA TAGATCTGCA ATCGATCAAA
GCAGCCGGTG TCACCTTTGC TGACAGCATG CTTGAACGCG TGATTGAAGA ACAGGCAAAA
GGCGACCCCT CCCGCGCCCG TGAAATTCGC GAACGTCTGG CCCCTGTCAT TGGCGACAAT
TTGAGAGGCG TCGCTGCTGG TTCCGAAAAG GCGGCGGAGG TTAAAGCGCT CCTTCAAGAA
ATGGGGCTCT GGTCACAATA TCTGGAAGTG GGTATTGGCC CTGATGCAGA GATTTTCACC
AAGTCCCAGC CGATGTCCGC CGTTGGATGC GGCGCAACTG TCGGTATTCT ACCGATTTCA
CAATGGAACA ATCCTGAACC GGAAGTGGTT CTGGTCGTCG CTTCTGATGG ACGCATTGTT
GGAGCAACTC TGGGCAACGA CGTCAATCTC AGGGACGTCG AAGGGCGTTC GGCTCTTTTG
CTCGGCAAAG CCAAGGATAA CAATGCTTCC TGCGCTATCG GTCCCTTTAT TCGCCTGTTC
GATGGCGAAT TCACTATGGA TGTGTTGCGC AAGCTCAAGC TGTCGCTGAC TGTTCACGGA
GCTGACGGTT TCGAGATGAC CGGCGAAAGC CCGATGGAAG CGATTAGTCG CGATCCTGAA
AATCTTGCAT CGCAGATGAT GAACCGCAAT CATCAATACC CAGACGGCGC CGTGTTGTTC
CTCGGCACGA TGTTCGCACC TGTCAAGGAC CGTCGCGGTG CCGGACAAGG TTTCACACAT
GAAGTCGGCG ACCGTGTGGA GATATCAACT CCCAAGCTCG GTTGCCTCGT CAATTGGGTC
GACAGGACCG ACAAATGTCC CGAATGGACA TTCGGCATAA GGGCGCTGAT CCGAAACCTG
CAAGCGCGCA ATCTGCTCGA TAAAATTTGA
 
Protein sequence
MQDKREVKAT AILPEDSHSA ILIGRVWSKA ENGPCPVLVK DGHVHDISSL SATVSGLLEH 
SSLLRELDGT ENFTNLGSLD DFLSGDRGEL LAPIDLQSIK AAGVTFADSM LERVIEEQAK
GDPSRAREIR ERLAPVIGDN LRGVAAGSEK AAEVKALLQE MGLWSQYLEV GIGPDAEIFT
KSQPMSAVGC GATVGILPIS QWNNPEPEVV LVVASDGRIV GATLGNDVNL RDVEGRSALL
LGKAKDNNAS CAIGPFIRLF DGEFTMDVLR KLKLSLTVHG ADGFEMTGES PMEAISRDPE
NLASQMMNRN HQYPDGAVLF LGTMFAPVKD RRGAGQGFTH EVGDRVEIST PKLGCLVNWV
DRTDKCPEWT FGIRALIRNL QARNLLDKI