Gene Oant_4341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_4341 
Symbol 
ID5381603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009668 
Strand
Start bp1770169 
End bp1771179 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content55% 
IMG OID640837030 
ProductApbE family lipoprotein 
Protein accessionYP_001372870 
Protein GI153011656 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATTG AAGGTGGTCA GCGTATCAGC CGTCGGCTTG TACTTGCGGG TGGTGCGGCT 
GGAGTAGTGT TCTGGGCCAT TCCAAAACAC GCAGTTTCAC TCGACCTTCC GGAGCCCATC
ATCTGGCGTA GTCAGGCGAT GGGGGCACCT GCCAAGATCA TACTCTATCA TCCGGATCGT
TCGACCGCTG AACGGTTGCT GCGCGAAGCG GCTCAAGAGG CCAAAAGGCT GGAGAACATC
TTCAGCCTCT ATCGCGAAGA TTCAGAACTT GCCCAACTTA ATCGCGATGG GGCGCTGGCA
TCGCCTTCGC CTGATCTGGT TGAGGTTCTG CGTATCTGCC ATGAATGCTG GCAGGCCAGT
GATGGCCTTT TCGATCCGAC CGTACAGCCT CTTTGGAACT GTCTGAAAAA GCATTTCTCG
CAAGAACATC CTTCTCCCGA CGGACCATCG CGACAGCTTT GGGATGAAGC GCTGGCGAAG
GTAGGGTTCG GTTATGTTCT GTTCGATGAT AACCGCATTG CATTCTCCAA GCCATCTATG
TCTCTGACTT TAAATGGTAT TGCGCAGGGC TATGTCACCG ACCGCGTGAC GGCATTGTTG
CAAAGGGCAG GAGTCGAATA TGCACTGGTC GATATGGGGG AGTATCGCGC TCTTGGTTCA
AGAGCTGACG GCACGGCATG GTGCATTGGC ATCGCCGATC TGGAAGCCGG AGCTGCTGCC
GAAGAGTATG TTGATATCCG CAATCAGGCG CTCGCGACAT CCAGCTTCAC GGGCTTTCAG
TTCGATGAGT CCGGACGGTT TAACCATTTG CTCAACCCAA AAACTGGCTT TTCAGCGGCA
CTTTATCGCC GGGTGACTGT CGTGGCCCGC GATGCGGCGC GAGCCGATGC CTGGGCAACT
GCGTTCAGCC TCATGGACAA GAATCAGATC GAAGCAGTTA TCGGTAGCCA ACAGGATATG
TCTGTGATTG CACAGACACG CTCTGGCGAA CGAATAGGCC TGGGTTCTTA G
 
Protein sequence
MRIEGGQRIS RRLVLAGGAA GVVFWAIPKH AVSLDLPEPI IWRSQAMGAP AKIILYHPDR 
STAERLLREA AQEAKRLENI FSLYREDSEL AQLNRDGALA SPSPDLVEVL RICHECWQAS
DGLFDPTVQP LWNCLKKHFS QEHPSPDGPS RQLWDEALAK VGFGYVLFDD NRIAFSKPSM
SLTLNGIAQG YVTDRVTALL QRAGVEYALV DMGEYRALGS RADGTAWCIG IADLEAGAAA
EEYVDIRNQA LATSSFTGFQ FDESGRFNHL LNPKTGFSAA LYRRVTVVAR DAARADAWAT
AFSLMDKNQI EAVIGSQQDM SVIAQTRSGE RIGLGS