Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1626 |
Symbol | |
ID | 4284596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1781491 |
End bp | 1782921 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638141113 |
Product | XRE family transcriptional regulator |
Protein accession | YP_756856 |
Protein GI | 114570176 |
COG category | [R] General function prediction only |
COG ID | [COG3800] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.928013 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.297966 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACA ACCAGGCTTT CGAGAAGATC TTCGCCGGCG CCCGCCTGCG CCGCCTCAGG CGCGAACGTG GCATGACCCA GGCCGAGGCG GCCGAAGCCC TGGGCCTGTC GGCCAGCTAT CTCAACTTGC TGGAACGCAA TCAGCGCCCG GTGACCGCGC GGGTGCTGCT GGCCCTGGCC GAAGCCTTCG ACGTGGATGT CCGCAGCTTT GCCAATGAGA GCGATCGGCA ATTGATCGCC GACCTCACCG AGGCGGCTGC CGACCCGGTC CTGGCCGGTC TCGAGCTCGA CCGCGTGGAA CTCAACGAGC TCGCCGACAG CCAGCCGCGC GCCGCCGAGG CCCTGACCCG CCTGTTCCAG TCCTATCGGG AAATGGCCAC AGCCACGGCC GATCTCGCCA CGCGCATGTC CGGCCCCGGC GCGACAACGG GCGGCCCCGG TGTGGTGCTG GAATCCGTCC GCAACGCCAT CGACGCTCAC CACAATCACT TCCCCGACCT CGAGGAAGCC GCCGAGGCCC TGAGCGACCG GGCCGGTCTG CGCAGTCGCA ATCGCGACCA GGCGCTGGCA GCCTATCTGC AGGACCAGCA CGGCTTTACC GTTCGCGTGC TGGACGAGGA CGTGATGGCC GGCGCCCGCC GCCGCCTGGA CTTTCACGGT CGCCGCCTGC TGCTGTCGGA GACCCTGCCC CCCGCCTCAC GCGGCTTTCA CATGGCGGTC GTGCTGGCGA GCCTGGAACA GGCCGATCTT CTCGACAGCC TGTGTGACCA GATCGATCTG CCCAGCGCCG AGGGACGGCG CCTGCTCAGG ATCGGTCTCG CCAATTATTT CGCCGGCGCG GTGCAGATGC CCTATGCCGC TTTCCACAGG GCCGCCGAGA CCAATCGCTA TCATCTCGGC GTTCTGCAAC GCCGTTTCGA GGCCAGCTAT GAACAGGTCT GCCATCGCCT GACCACGCTG CAGCGACCGG GCGCGCGCGG CCTGCCCTTC TTCATGATCC GGGTCGATGC GGCGGGCAAT GTCTCCAAGC GCTTCGGCGG CGGCATCATG CCCTTTGCCC GCGCCGGCGG CGGCTGTCCG AAATGGAATC TCTACGACGC GCTGCGGATG CCCGAGCGGA TCCTGACCCA GTCCTTCGAA CTGCCCGACG GCACCCGCAT GCTGTCGCTC GCCCGTGGCC AGTCAACGCA AGGCCCCACA GGACAGCCGC CCGTCCTGCA CGCGATCGCC CTGGGCTGTG ACTGGGACAA TGCCGGCAAG ATCGCCCATG CCGACGGGAT GAGTGACGCC AACCCGGCCG CGATCGGTCT CGCCTGCCGC CTGTGTGACC GCGAAGACTG CGCCCAGCGC GCCTTCCCGC CCCTCAACCG CAAGCTGACA ATGGACCCGC ACCAGCTGCG GGCCTCGCCT TATGCGTTCG GGGAGAGTTG A
|
Protein sequence | MSDNQAFEKI FAGARLRRLR RERGMTQAEA AEALGLSASY LNLLERNQRP VTARVLLALA EAFDVDVRSF ANESDRQLIA DLTEAAADPV LAGLELDRVE LNELADSQPR AAEALTRLFQ SYREMATATA DLATRMSGPG ATTGGPGVVL ESVRNAIDAH HNHFPDLEEA AEALSDRAGL RSRNRDQALA AYLQDQHGFT VRVLDEDVMA GARRRLDFHG RRLLLSETLP PASRGFHMAV VLASLEQADL LDSLCDQIDL PSAEGRRLLR IGLANYFAGA VQMPYAAFHR AAETNRYHLG VLQRRFEASY EQVCHRLTTL QRPGARGLPF FMIRVDAAGN VSKRFGGGIM PFARAGGGCP KWNLYDALRM PERILTQSFE LPDGTRMLSL ARGQSTQGPT GQPPVLHAIA LGCDWDNAGK IAHADGMSDA NPAAIGLACR LCDREDCAQR AFPPLNRKLT MDPHQLRASP YAFGES
|
| |