Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0073 |
Symbol | |
ID | 4283939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 71128 |
End bp | 72828 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638139536 |
Product | hypothetical protein |
Protein accession | YP_755307 |
Protein GI | 114568627 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGGTG ACGTACATTA CGAGGTCTTC TTCAAGAAGA ACCGCAAGGC GAGCTGGGCC CTCCACGAGG CCCGCGATGA CCGCGACCAG GCCATCCGCC TCGCCCATTC GCTCGTCGCC AAGCAAAAGG ACGCCTCGGT CCGCGTGACC AAGGAGACCT TCGACCAGGA ACACCGCAAA TTCCGCTCCG TGCCCGTATT CGAACGCGGT GCCGAGATGA TGGGGGCTGA AAAAGAAAAG ACCGGCGAGG CCCGTCTGCC CTGCCTGACC CCGGACGATC TGGCCAAGCC GCATGCGCGC GACACGATCC GCCGGGTGCT GACCGGCTGG CTGGAGCGCG TTCAGGCCAT CCCGATGGAA TTGCTGCACC GGCCCGATCT GGTCGAAAGC CTCGAAGCCT CCGGAACCGA ATTGCAGCAC GCGGTCCAGA AAGTCGCGAT TGCTTCGGCC AGTGACAGCG ATGCCGGCGT GCACGGCTAT GTCAAACAGC TCAACGAGCT GGTCCAGAAG TCGCTGGCCC GAATTTACAA GGACGGCCGT GACAATCGCC TGCCGGAATA TCCCAAAAAG GCGGACTTCG CCGAGATTGC CGGTGAGATC CACAAGCGGG ACCGGCGCGC CTATTCACTG CGCGCGGCCA TGGCCGACCG CCTGCGCCAC GAAAAGAAAT ATGGCGACAA GCTCGAAGCC CTGCTGGACA TGGGCGACAA TCTGCCTGCC GACGAAGACG CCCGCAGCTT CGCCCTGGAC GAAGTCGACA GCTATATTGC CGAAGTCATC GCCTTTGATG CCGGGCGCGA GGCCCTGTTG GGCAAGTGCA AGGATCTTGG CGAAACCCTC GAGCGGCTGG CCTGCCTGTT CGATGGCGAC CACTCGGCCG ATGCATTGAA CCTCGCTCCC AGTGCCGCCA AACGGCTGGC CCGCAAGATC AAGGGCAAGG AGTTTCCAGC CTGTCGCGCG ACCATCGCCG GCTGCATCCT GAAAGACCTC GAACGCCCCA AACGCCTGCG CCCGAGCAGC GTCCGCGATG AAGTCCGCCT GGCCCGTGAC CTCGCCAGCC GCCTGGTCAT CTGCGCCGAC AGCACCCTGC CCGCCGACGC GCTGATCAAG GCCTTCGCCT CACGCTCGGC GCGACTGCTG CAGCCCGAGA TCATCGATGA ATTGCTGCGT CATTCGCGCG GTGCAGACGA GGAACTCGAC CGGCTGATCG CCCTCGAGGA AAACCTCGTC GGCGAGAGCA ACAAGCAAAA ACTGGCCGGC TATATCCGCT CGACACTGGG CTCCAACCAG GCCGATGCCT GGTATGTGCG CGGTGATGCC AAGCCGCTGG AACGCCTGGC CAAGCTGACA TCGCAGCAGG CAAAAGTGCT CAAGGGCGGA TACCCCGAGC GCGACAAGCT CGAGCTGGCT GCCAGTTTTG ACGCCATGGG CATGAAGGTC GTCGACGACA GCAAGATCCT CAACATGGTC GAGGGCGGCG ACCGTCCGGC GCTCGACAAG GCGACCGGGC TGTTGCGCCT GGCGACCGGC GGCGCGCTGC CGATCGGCAA GTGTTCGGCA GACGCCCAGG CGCGCGCCTT GCGTCATTTG AAGTCGGCTG TCGGTCTAAG CGAGGCCCAG GCCGAGGACG GTCGACCCAA GCTGCGACAA ATCCAGGGCA TGCTCCAGGA ATTGACGATC CTGCAGACAA AGTCCGCCTG A
|
Protein sequence | MAGDVHYEVF FKKNRKASWA LHEARDDRDQ AIRLAHSLVA KQKDASVRVT KETFDQEHRK FRSVPVFERG AEMMGAEKEK TGEARLPCLT PDDLAKPHAR DTIRRVLTGW LERVQAIPME LLHRPDLVES LEASGTELQH AVQKVAIASA SDSDAGVHGY VKQLNELVQK SLARIYKDGR DNRLPEYPKK ADFAEIAGEI HKRDRRAYSL RAAMADRLRH EKKYGDKLEA LLDMGDNLPA DEDARSFALD EVDSYIAEVI AFDAGREALL GKCKDLGETL ERLACLFDGD HSADALNLAP SAAKRLARKI KGKEFPACRA TIAGCILKDL ERPKRLRPSS VRDEVRLARD LASRLVICAD STLPADALIK AFASRSARLL QPEIIDELLR HSRGADEELD RLIALEENLV GESNKQKLAG YIRSTLGSNQ ADAWYVRGDA KPLERLAKLT SQQAKVLKGG YPERDKLELA ASFDAMGMKV VDDSKILNMV EGGDRPALDK ATGLLRLATG GALPIGKCSA DAQARALRHL KSAVGLSEAQ AEDGRPKLRQ IQGMLQELTI LQTKSA
|
| |