Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Oant_4768 |
Symbol | |
ID | 5383168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ochrobactrum anthropi ATCC 49188 |
Kingdom | Bacteria |
Replicon accession | NC_009670 |
Strand | + |
Start bp | 37442 |
End bp | 40432 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640837345 |
Product | hypothetical protein |
Protein accession | YP_001373185 |
Protein GI | 153011973 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.439128 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTTTT TTCTCGGAGC CTTTGAGAGT GAATGGGAGC AGCATCGCGC GGCGCTGTTG CATGAGCTTT CTTTGGGTTC CAGTGCTGCG AGTGGCGCGG AAGAAGAAAT GCGCCGGAGG CTGCGCGAAC TGCAAATCGC AGGAGGTGTC GCAGCAGCAG GTAGTGCTTC AGTGCCGAAA ATGCGTACGG GATCCTCAAG GGCACAGGGT TTCCAGATCT TTGGAAGTAC GGCAAGCAAG CCAATTGCGG CAGCTAGGCC GATGGAAGCG CGTCTGGCCG CGGTCGCAGC AGGCAGTCAA CCGTCCGTGG TAAAAATGGC ATCGTACGGT GGCGGTGCGC GGTTCGGCGC GATGGTCAAT TACGTATCTC GCAGCGGGGA AATACACGTC GAAAAGGAAA ATGGTGAGCG CCTTCATGGT CGCGAGGATC TAGCCCGGTT GCGGGGAGAT TGGGAGCCAT TGTTTCAAAA TCGTGCCGAG AGCAGGGATA TTGGAACTTT CCATGTGACG ATCGAAGCTA CTGGTTTCAA AACAGAAGAA GCTTTGCACC AGCTCGTACG CGAAAGCCTG GCGAGCGGGT TAAGTAATCG CAGCTTCGCT TATGCAGTTT CGCAGCCATC ACATGGAGTT TTGAAAGTTG ACGGTCTGGT GGTCTTGCGC AGCGCGGAAG GGGAGCGGTT GACCGGCGAC CCGAAGGCGG CTGACATTAT TCAGAGACGT TACAACGAAA GCGCGGTGTC CGAGCTGGCG AGGGCGCGGT TCCGTTTTAC GGGGTATGGT AATGGCGTGG AATATGGCGC TGCCAAGCTG CGAAATCTGG TCGAGGAACA TAAGGGAGAA GTTCGCGATG ATAAGGGGCG CGAGATCTCC GATGCGCGTT CCGCCGGCGA CCTGGTGCAA AAGCAGTGGC GTCGGGAGTT GCATAGCCGC AAGGGCAGGG ATGTGGCGCA TGTCATCATG TCTGCCAAAG CAGGCACCAA TGTTCAGGCT TTCGAAGGCG CTGTTCGGGA TTTTCTGGCC GCTCAGTTCG AAGGACACCG CTACATATTT GCGATGCATG ATCCGTTCAG CGATCCGAAG GAGATGGGGC AGGGCGGAAA GCGACCGCAT ATCCATGCCC ATGCCATTAT CGCTATGCGT TCGGATGCCG GTGATCGGGT AGAAACCTCG CCACAGGTTT TTCGCCAATG GCGCGAACTT ATGGCTGAAA AGGCGCGCCA GCACGGAATC GCTATGGAAA TGACGGACAG GCGCGAGTTT GCAAATGCTC CAGCCTACAC GCGCAATCAG GTGCGGCCGG TTAATCGCGA GGGACGAACG GAACATGAGG GGACGAGTGC GGCGGCTCAG TCGCGTTATG ATGCCAAGCG GTCAGGTCGG CGGAGCGTGG CGCGAAGCGT GCGCAGCCTT GAATATACCG TAAAGGTAAC TCAGAGTTGG GAGAAAATTG CAATGGCAAG CGGCAATCAT CAGATTGCCT CATACGCCAT CCAGCAAAGA GATATAGTTG TATCAACCAG TTCGCATCAA TCGGAAGCTA AAACTTCTAA CATCGTTCAT GCAGAATTCG GGTCACATTA CCGCACCAAT TTGGTAAAGT TGCAAAACAT CATTTTGGAG GGTGAAAAAG TGCGCGAAAT GTCGAGAAGC GATTTTGAGG CATATGAGAA AGAAGTTGAA ACAGCTTTGT TCCGGCTTGG CCGCAATCTC GGACCGGAAG AGAAAGCCGA TCTGGATCAG GTAGCGCAGT ATACGCGCGA GCATGTCAAT CTAATGCGCG AGCATATGGA ATTGACTGAA CAGCGTGGAT TGACCGTTGA AACCCCATCT CGTGAAAGCG AGCAAGATAG CTCGTCAATG CAGATCGCTC GCGAAGCCGA TTCCGAAATT GCCAAGCCAG CGCATGTTAT CGTGGAAGCA GAGCAGGGTC GGCCATCGGC AGATAATGCC AGCGAACCTC GTTCAAGCAT GGATGACGAG GAGAAAGCAG CTGCAGCATC CTATCAGGCC GAAATGGATC GTTCATTCCC GGATGAGATC ACGAGACGAT ATTATATTCG TGAGGATCAC GGGGGGACGC AGCGAGTCTT TGTCGATTCA AAGGGTGAGC GCGAAGTGTT CCAGGACAAT GGAGAGAAGC TGCGCGCTAA ATCGTTCGAT GCTCAGGGCG TTCGGTTGAT GATCGAAACA GCAGCGCATC GAGGCTGGAC GAGCATCGAA ATAACGGGGA GCAAGGAGTT CCGCCGTGAA ACTTGGTTGG AGGGCCAGGC GCACGGTATT TCGGTCAAGG GCTATCAGCC GACGGAACTG GATTGGCAAG ATCTGGCGCG TCGCGAGCAG TCCTACTTGC GCAATGAAAT TGTTCCGATC GAGGGCAGGG CGCTGGACGC GGCTCACCGG GATCAGGAGC AATCTGCAGG ATCGGATCAG TCGAGCAAGG TCGATAGAGA GCCACAGAAT GCAGATAATT CTGGAACGTC ACACTCGAAG ACCGTCGACT ATAAGGAGGG TGTGCAGGGC ATCCTCGTTG AGACAGGCGA AAAGCCTTAT CAGGATAATG AAAAAAATGA GCCTTCGCCA TTTGTCGTTA TTGAAACGGC CAATGGCAAT CGCACGGTAT GGGGTGTCGG ATTACCGGAT GCGTTGCATC GCGCCGGCGC TGAAATCGGC GATGAAATTC ATTTGCGGTC CACTGGGACG GAACGCGTTT TGAAAACCGT CATTCAGGAA GTCGATGGTC AGAAGCAGCG TGTCGAGCAA ATGGTCGATC GCCGGGCATG GGAAGCGAAC GTGCTTGAAG AGCGGGATCG GACAGATGGA AAAGTTGAAG GTTCCAAACA GCTCGACAAT CAGATTAATA TGGAAGTGAA GGGCGTTGCT CGCGATGGAG AAACCCTGCG CACCGATCCA CCCCAGCAAC AAGTTCCGCG ATTGCAGGAA CTAGAACAAG AGCAGCAGCA AAAGAAAGAA CGCGACGAAC ACGAGCGTTA A
|
Protein sequence | MEFFLGAFES EWEQHRAALL HELSLGSSAA SGAEEEMRRR LRELQIAGGV AAAGSASVPK MRTGSSRAQG FQIFGSTASK PIAAARPMEA RLAAVAAGSQ PSVVKMASYG GGARFGAMVN YVSRSGEIHV EKENGERLHG REDLARLRGD WEPLFQNRAE SRDIGTFHVT IEATGFKTEE ALHQLVRESL ASGLSNRSFA YAVSQPSHGV LKVDGLVVLR SAEGERLTGD PKAADIIQRR YNESAVSELA RARFRFTGYG NGVEYGAAKL RNLVEEHKGE VRDDKGREIS DARSAGDLVQ KQWRRELHSR KGRDVAHVIM SAKAGTNVQA FEGAVRDFLA AQFEGHRYIF AMHDPFSDPK EMGQGGKRPH IHAHAIIAMR SDAGDRVETS PQVFRQWREL MAEKARQHGI AMEMTDRREF ANAPAYTRNQ VRPVNREGRT EHEGTSAAAQ SRYDAKRSGR RSVARSVRSL EYTVKVTQSW EKIAMASGNH QIASYAIQQR DIVVSTSSHQ SEAKTSNIVH AEFGSHYRTN LVKLQNIILE GEKVREMSRS DFEAYEKEVE TALFRLGRNL GPEEKADLDQ VAQYTREHVN LMREHMELTE QRGLTVETPS RESEQDSSSM QIAREADSEI AKPAHVIVEA EQGRPSADNA SEPRSSMDDE EKAAAASYQA EMDRSFPDEI TRRYYIREDH GGTQRVFVDS KGEREVFQDN GEKLRAKSFD AQGVRLMIET AAHRGWTSIE ITGSKEFRRE TWLEGQAHGI SVKGYQPTEL DWQDLARREQ SYLRNEIVPI EGRALDAAHR DQEQSAGSDQ SSKVDREPQN ADNSGTSHSK TVDYKEGVQG ILVETGEKPY QDNEKNEPSP FVVIETANGN RTVWGVGLPD ALHRAGAEIG DEIHLRSTGT ERVLKTVIQE VDGQKQRVEQ MVDRRAWEAN VLEERDRTDG KVEGSKQLDN QINMEVKGVA RDGETLRTDP PQQQVPRLQE LEQEQQQKKE RDEHER
|
| |