Gene Oant_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_3066 
Symbol 
ID5381985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009668 
Strand
Start bp378539 
End bp379885 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content57% 
IMG OID640835743 
Producttype III effector Hrp-dependent outers 
Protein accessionYP_001371603 
Protein GI153010389 
COG category[S] Function unknown 
COG ID[COG3395] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.519748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACAGG CTTCCAACGA ATCGCTGCTG TCTTATTACG GCGATGATCT GACCGGCTCG 
ACCGATGTTA TGGAAGCCAT GGCATCCAAT GGCGTCGACA CTGTGCTGTT CATGAATGTG
CCGGATGAAG AGCTTCTTTC ACGCTTTTCG CACTGCAAGG CTATCGGTCT TGCTGGAACC
AGCCGCAGCG AAACGCCGGA ATGGATGCAG GAAAATCTCA CGCCAGCGTT CAAGTGGCTC
AAAAGCCTGA ATGCTGCGAT TGCGCATTAC AAGGTCTGCT CGACATTCGA TTCCGCACCT
CATGTAGGCA ATATTGGCAA GGCCGTCGAA ATCGGCAAAG CAATCTTCGA TCAGGCCTAT
GTGCCGTTGA TTGTCGGTGC GCCGCAGCTC AAGCGTTACA CATCTTTCGG CAATCTCTTT
GCCGCCTATC AGGGCGAAAC CCATCGTATC GACCGTCATC CCGTCATGAG CCGCCATCCG
GTCACGCCCA TGCATGAAGC AGACCTGCGC GTGCATCTGG CAAAGCAGAC CGCACTGAAA
ACGGTGCTTG CTGATCTCGT AGCGCTTTCA GCCGCAGATG CAAATGATCG CATAAACGCA
ATTGTCAAAG ATGCTGACGG CATGTTGTTG CTTGACGTCG ACAGCCACGA AAGCCAGTTG
CAGGCAGGCG AGCAGCTCTG GCGTTTGCGT TCCCGCGACG GCTGGTTTGT TGCTGGTTCG
TCGGGTGTTG AATATGCGCT TCTGGCCGCA TGGGCCAAGG CCGGACTGAT CGGTGCAAAA
AAAGAGTTTC CGCTTCCGGG CAAAGCCGAC CGCATTGCGG TCGTTTCAGG AAGCGTCTCA
CCGACGACAG AACGTCAAAT TTGTCACGCG ACGGCATCAG GCTTTACCGG CATTGATCTC
AACCCACTCG ATCTGCTGGG TGAGAATGGC AATCAGGCAA TCGAAGCTGC CATTGCTGAC
GGCCAGAAGG CGCTCCAGCA GGGCAACAGC GTTATCCTCA ACACCGCCCT TGGTCCATCG
GCTGACCGAG GTACAGAAAT CGACAAGATT GCAGGCAGCC GCCACAAGCT GGCCCGCAGC
CTCGGCCTCA TCCTCCGCAG CCTTGTCGAG CGCGAAAGGC TTACCCGCGC AGTCATTGCA
GGCGGCGACA CTTCGAGCCA CGCATTGCGT GAATTGCATG TCGATGCACT CACCACGCTT
TTGCCACTTC CACAAACACC GGGGTCGCCG CTTTGCACGG CGCACGGCGC TCACGCCGCC
ACCAACGGCC TCCAGATTGC CCTCAAGGGC GGTCAGGTCG GTTCTGACGG TTATTTCAGT
CAGATTCGCG ACGGCCTGAC CGCTTGA
 
Protein sequence
MSQASNESLL SYYGDDLTGS TDVMEAMASN GVDTVLFMNV PDEELLSRFS HCKAIGLAGT 
SRSETPEWMQ ENLTPAFKWL KSLNAAIAHY KVCSTFDSAP HVGNIGKAVE IGKAIFDQAY
VPLIVGAPQL KRYTSFGNLF AAYQGETHRI DRHPVMSRHP VTPMHEADLR VHLAKQTALK
TVLADLVALS AADANDRINA IVKDADGMLL LDVDSHESQL QAGEQLWRLR SRDGWFVAGS
SGVEYALLAA WAKAGLIGAK KEFPLPGKAD RIAVVSGSVS PTTERQICHA TASGFTGIDL
NPLDLLGENG NQAIEAAIAD GQKALQQGNS VILNTALGPS ADRGTEIDKI AGSRHKLARS
LGLILRSLVE RERLTRAVIA GGDTSSHALR ELHVDALTTL LPLPQTPGSP LCTAHGAHAA
TNGLQIALKG GQVGSDGYFS QIRDGLTA