Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1259 |
Symbol | |
ID | 4283800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 1372704 |
End bp | 1374062 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638140738 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_756489 |
Protein GI | 114569809 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.563551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.00447434 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGAAACA GTTTGATGAC ATTCGCCTCT GCTGCCGCGC TGCTGACCGT GGTGGCCGCG TCTGCCTTCG CCGGGGGCGA CGACCATCCG CGCCGTGGCC TGGCCATCGG CGGCGAATGC GAAGGTTGCG ACTTTTCCGA CAAGAACATG GCCAATGCCG TCTTCATCGG CGCCAATTTC GAAGGGACGA GCTTCTCCCG GTCGGAACTC CATTCAGTCT CGATGATCGA GAGCCGGTTC AACGATACCG ATTTCACCGA CACCATCATG GTCAATTCAC GCTTGACCGG GGTCACCTTT TCACGGGCCA ACCTGTCCGG CGCGACGCTG TCCAACTCGA TCATTGAGCG CGTGAACTTT TCCTCGGCTG ACCTCTCCGA GACCGACTTC ACCGGCTCCC AGATGGTCTT CGTGACCTTT ATCGGCTCGT CGCTGAACGA CACGCTGTTT GCCGACGCCC GGCTGCGCCA GGTCAATTTT CTTCGAGCCG ACCTGAATGA CACCAATTTC CGGAGTGCCG AGCTGCGCCG CGCCAATTTC GCCGAGGTTC ATGGACGAGA TGTCAATTTC CAGGACGCGG ATCTGAGCGG CGCCAATTTC CGGAACGCCG ATCTGCACGA TGTCGACTTC ACCGGAGCGA TATTGCTGGG CGCCAATTTC TCTGGTGCCG AAATCCGGGA AACCACCGGT CTGAACAAGT CCACGCTTGC CGGCGCTTGC GGCAATGACG AAACCTCATT GGGCGAGGAC ATCGCCTTGC CTGCCTGCAG CGGCCAGCCT GTTTCGGCTC CTGGTGAGGT GTCCCTGGAA TATGCCCTTC GCCTGACGGG TTCCGAGCGG GTCGAACGCA TGGTCGAGCT ACAGGCGCTG GAAAATGCCC GCATCGAAAT CCTGTCAGAG ATTGGCGCCG CCCTGCAATT GCACGCGACC GAACTCGACC GGGATGCGAT CCGGTCCGCA GCCAATGAGG CGATGCGGGA GATCGCCCGG TCCGAGCGGG AATTGCGGCG CATCGAGCGA CAGCGCGAGC ACCGCCGCGA AGCTGTCAGC GATGAACGTC GGCGCATCAT CGAGGATGCC CGCCGCGAGG CGATGCAAGC CAATGAGGCT GCCCGCGAAG CGCTCCGCGA GGATAGCCGG GCACGGACCG AATCGACACG GCGGGCAACG GCGCGCATCG AAACCGAACA CGGCACCTAT ATCATCGAAC TTCCCGATAC CGACACCGAC CCGATCTGGG TGGTCGACCA TGACGAAACC CGCCATCACG TCTTCCCGAC GCCGCCCGCA GCACCGGCTC CGCCGCGGCC CGATCCGGCG CATGAGAACC ACGACACGCC GGACCACCCC GAGCCCTGA
|
Protein sequence | MRNSLMTFAS AAALLTVVAA SAFAGGDDHP RRGLAIGGEC EGCDFSDKNM ANAVFIGANF EGTSFSRSEL HSVSMIESRF NDTDFTDTIM VNSRLTGVTF SRANLSGATL SNSIIERVNF SSADLSETDF TGSQMVFVTF IGSSLNDTLF ADARLRQVNF LRADLNDTNF RSAELRRANF AEVHGRDVNF QDADLSGANF RNADLHDVDF TGAILLGANF SGAEIRETTG LNKSTLAGAC GNDETSLGED IALPACSGQP VSAPGEVSLE YALRLTGSER VERMVELQAL ENARIEILSE IGAALQLHAT ELDRDAIRSA ANEAMREIAR SERELRRIER QREHRREAVS DERRRIIEDA RREAMQANEA AREALREDSR ARTESTRRAT ARIETEHGTY IIELPDTDTD PIWVVDHDET RHHVFPTPPA APAPPRPDPA HENHDTPDHP EP
|
| |