Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0664 |
Symbol | |
ID | 4285241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 762111 |
End bp | 763379 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638140129 |
Product | AraC family transcriptional regulator |
Protein accession | YP_755895 |
Protein GI | 114569215 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTGGG TTTCAGGTTT CCGGGATTAT CCATTGGAAC AGCTCAAATT GGCGCTTCTT ACCGCCGGGT TCTCGCAATC GCTGCTCGTG GCCGTGATGC TGATTGCCTA TGCGCGCAGC AAGGCGGAAC CCCGCTTCTA CCTGGGCGTC TTCGTCGGTC TTTACGGTTT GACATTCCTG GCCGAAATCA CGTCCATGAC AGGCGTCCTG GCAGCGGCGC CCGGCCTCGC CGTGCCGCTC ATAGCCGCGC CGATGTTGCT GGGCCCGCTT CTTTATCTGT ACATCACCAA AGTCGTTAAT CGCGAGGGTG TTCTTTCGGC CCCGGTCAAC CGGCAACACT TCATTGCGCC CGCTATCGGG GCGGCTTTCA CGTCGGGACT GTTTTTTGTG CCACCCGACG CGTTTGGCAG CTTCATCACT CAGGACGTGG AGCCCACCAC CCCGGTTGAC CTGCTAATCG TGTTGATGCT GGCCGTTGTC TTGCTGGTCA CCTTCATCAT CACCTTGATC TACATCATCA AGGGGTATGC GCTTCTGAAC AGGAGCGCGG CCAGCATTCG CGACGAGTTT TCCAATCTGG ATCGCAAGCG ATTGAGATGG CTGAAAGTGT CGTTGATCAA CATCTCGCTG CTGTGGGTGG TGACGATGGC GGCTGACTGG ACCGACTGGT CGGAGGCATC GGCCGGCTTC ATCGCGCTGA GCGTTTGGGA ATTGCTGAGC TTCTATGTGA TGGCGATCCT CGGACTGCGT CAGGGCGTCA TTTTCGGACC TCAACCCTCG CCGCCCGCGC CTGTAGCGCC TGTGCCGCCG GTTGGGGCCC CGCACGAACC CGATTCCGGT CCGGCCCCCT CGGTTGGCCG GCATGAGGTG GAGCAGAAAT ATTCCAAATC CGGCCTGACC CAGCGGGATA TGGATCGCAT CGCCGCAAAG ATCCGGCAGG CTGTCCATAC CGACGGCATT TATCGCAACA GTGATCTGTC ACTGACCCAG TTGTCGAAGT GCATCGGTGT CCAGGCGCCC TATGTGTCCC AGACACTGAC CCAGCAGATC GGCAGTACGT TCTACGACTT TATCGCCCAG GCCCGTGTCC GCGCGGCGAT GGACATGCTG GCAGATCCCG GCAATTCGGA GTCTGTGCTG TCGATTGCCC TGGCGGTTGG TTTCAATACC AAGTCGACCT TCAATTCTGC GTTCAAACGC GTGAGTGGTA CGACCCCGAC GGAATGGCGG CGAAGTCTCG GTGGGCACCG CAACTCCGCC GCCGGCTGA
|
Protein sequence | MDWVSGFRDY PLEQLKLALL TAGFSQSLLV AVMLIAYARS KAEPRFYLGV FVGLYGLTFL AEITSMTGVL AAAPGLAVPL IAAPMLLGPL LYLYITKVVN REGVLSAPVN RQHFIAPAIG AAFTSGLFFV PPDAFGSFIT QDVEPTTPVD LLIVLMLAVV LLVTFIITLI YIIKGYALLN RSAASIRDEF SNLDRKRLRW LKVSLINISL LWVVTMAADW TDWSEASAGF IALSVWELLS FYVMAILGLR QGVIFGPQPS PPAPVAPVPP VGAPHEPDSG PAPSVGRHEV EQKYSKSGLT QRDMDRIAAK IRQAVHTDGI YRNSDLSLTQ LSKCIGVQAP YVSQTLTQQI GSTFYDFIAQ ARVRAAMDML ADPGNSESVL SIALAVGFNT KSTFNSAFKR VSGTTPTEWR RSLGGHRNSA AG
|
| |