Gene Oant_3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_3787 
Symbol 
ID5381851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009668 
Strand
Start bp1176873 
End bp1178468 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content54% 
IMG OID640836473 
Productextracellular solute-binding protein 
Protein accessionYP_001372322 
Protein GI153011108 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.216169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAT TTGCATTGCT GGCAGCGCTG TCCGCAGTCC TGCTGGCCGG AAGGGCGGCT 
TACGCGGCGG ACGAACCACG TTCGGGAGGC GTCATCAATT TTGTCGCCCC CTACGGCGAC
AGCTTCTCGA CGCTCGACAT TCAAGCATCG CCTGCCACAC AAGACGAGTT CTACGCCAAG
GCGATTCACC GTACGCTCTA TGATTGGGAT GCTGATCTCA ACAAACCGGT ACTCGGTCTT
GCAACCGATG TGACGGTGTC CGACGACAGG CTCGTCTATA CCTACAAGCT TCGTCAGGAT
GCATTTTTCC ACAATGGCAA GCCTTTGACG GCAGACGACA TTATCTGGAG CTACACCCGG
ATCATGGATC CAAAAAAGGC ATTCCCACCG GCACGTTACA TCGCCGAAAT CAAAGGGGCG
GAGGAATATA CGCAGGGCAA GGCCAAGGAA ATTTCCGGCC TGAAGAAGAT CGATGACCAT
ACGCTCGAAA TCACGCTGAA ACAGCCAATT GATCCGGGCT TCCAGTTCAT GCGTAACAAC
ACTGCCATTT ACCCGGCAGG TGAGGGGGAT AGCGAAGAGT TTCAGCGCCA CCCGATTGGT
CTCGGACCTT ATAAGTTTGC GGAATATGTT CCCGGCTCTC GCCTGACAGT GGAGAAGTGG
GACAAGTACT ACGAAAAGGG CAAGCCCTAC GCCGACAAGA TCAATATCAT GATCATGGGT
GATGCGGCCG CGCGCGACGT TGCGTTCCGC AACAAGGAGA TCGACGTAGC CGTGCTCGGG
CCGGCACAAT ATACCGCCTA TCTTGCCGAT CCTGAACTGG CAAAGAATAT GGTCGAGGTA
GCTGAGGTAT ATACGCGCGC CGTCGGCTTT AATCCGGATT TCAAGCCGTT CCAGGACAAG
CGCGTTCGTC AGGCAATCAA CTACGCCATC GATAGCGATC TCATCATCAA GCGTCTCGTC
AAAGACAAGG CCTATCGTGC TGTTGGTTGG CTGCCAAACT CATCGCCAGC CTTCGACAAG
GATGCGAAGC CTTATCCGTA TGACCCTGAA AAAGCAAAAG CTTTGCTTGC TGAAGCTGGG
TATCCCGACG GCTTCGAATT CGAATTGACA GCAACGCAAA ACGAGAGTTG GGGCCTGACC
ATCGTTCAAG CGATCATTCC GATGCTCGCC AAGGTCGGTA TCAAGGTGAA GGCAAAGCCG
GTCGAAGCGT CAGTGCTTGC GGATGTAGTT CCGGCTGGCA ATTTCCAGGC CTATATGTGG
TCGCTCGAAA GCGGTCCGGA TGCACTGACG GCGATGCAGT GCTTCTATTC CACGACCCCA
CAATCTGCGT GTAACTACCA GAAGTTCTCG AACGCAGAAT TCGACAAGAT TGTCGATGAG
GCAAAGGTGG CGAAAACCGA AGAAGAGAAG AACGAGCTTC TGAAGAAGGC CAACAACCTG
TTGCAGGAAG AAGCGCCGGT CTGGTTCTTC AACTACAATA AGGCTGTCAT GGCACATCAG
CCATGGTTGC ATGGCCTGCA GCCAAACTCG GCGGAGCTCG CCGTTCAGTC CTATGAAAAG
CTGTGGGTTG ATGACACGGT TCCGTCCGGT CGCTAA
 
Protein sequence
MRKFALLAAL SAVLLAGRAA YAADEPRSGG VINFVAPYGD SFSTLDIQAS PATQDEFYAK 
AIHRTLYDWD ADLNKPVLGL ATDVTVSDDR LVYTYKLRQD AFFHNGKPLT ADDIIWSYTR
IMDPKKAFPP ARYIAEIKGA EEYTQGKAKE ISGLKKIDDH TLEITLKQPI DPGFQFMRNN
TAIYPAGEGD SEEFQRHPIG LGPYKFAEYV PGSRLTVEKW DKYYEKGKPY ADKINIMIMG
DAAARDVAFR NKEIDVAVLG PAQYTAYLAD PELAKNMVEV AEVYTRAVGF NPDFKPFQDK
RVRQAINYAI DSDLIIKRLV KDKAYRAVGW LPNSSPAFDK DAKPYPYDPE KAKALLAEAG
YPDGFEFELT ATQNESWGLT IVQAIIPMLA KVGIKVKAKP VEASVLADVV PAGNFQAYMW
SLESGPDALT AMQCFYSTTP QSACNYQKFS NAEFDKIVDE AKVAKTEEEK NELLKKANNL
LQEEAPVWFF NYNKAVMAHQ PWLHGLQPNS AELAVQSYEK LWVDDTVPSG R