Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1070 |
Symbol | |
ID | 4284479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1166830 |
End bp | 1168407 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638140542 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_756301 |
Protein GI | 114569621 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1174] ABC-type proline/glycine betaine transport systems, permease component [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.019749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAGCA ATCTCGCCGA GCGCTGGGCT GAACTTCCGG ACCTTCTCGC CGGACACATG CTGTTGTCGC TGGCGGCCAT ACTGGTCGGC CTGTCGATCA GCCTGCCGCT TGGAATTCTT GCGGCGTCCC GACCGCGCCT GGCGGCGGTC GTGCTCAATG CGGCCAGCGT CATCCAGACC ATTCCCGGCC TTGCCCTGCT GGCATTGATG GTACCGCTTC TTGGCGGGAT GATCGGTTAT GCGCCAGCCT TCCTGGCGCT GATGCTCTAT TCGATCCTGC CGATCCTGCG CAACACGATT GTCGGGCTCC AGGGCCTTGA TCCGGTTGTG CGCGAGGCTG CCCGCGGCAT CGGCATGACG CCGCGTGAAC GGCTGTGGCA GGTCGAGCTG CCGCTGGCCC TGCCGGTCAT TGTCGCCGGA TTGCGTACGG CGGTGGTGTG GGTGGTTGGC GCGGCGACGC TGGCAACGCC GGTCGGGGCC TCCAGCCTGG GCAATTACAT CTTTGCCGGC CTGCAGACCC GCAATTGGCT GTCCGTCCTG TTCGGCTGCC TGTTTGCGGC CGGGCTGGCC ATTGCGCTGG ATCAGATCAT CCGCTTGCTG GAAGTCGCCG CCCGGGATCG CCGCAAGGGG TTGGCGGTCG CTGTCGGCGT CGCGCTTTCG GCACTGGTCG TCATCACCAT CGCACCGCGC TATGTGCCGG CCGCTGTGTT CCGAGGGGCG GGGCCGGTCG TGGCGCTTCA GGACGGTGCC GACACCCTTG CCGATCAGAC TGTTGTGGTC GGCGCCAAGG CCTTCACCGA GCAATACATT CTCGGTGCCC TGATGCAGAC GCGTCTGGAA GAGGCCGGGG CCCGCGTCGA CCGGCTCGAC AATCTGGGTT CCACCATCGC CTTCGATGCC CTGCGCACGG GCGAGATCGA CGTCTATATC GACTATACCG GCACAATCTG GGCGACGGTG ATGAACCGTT CCGAGCCGAT CAACCGCAAT GCGATGCTCG CCGAGATGAC CGGCTGGCTG TACGCCGAAC ACGGCATTCT GGCGCTTGGC GGGCTGGGAT TTGAAAACGC CTACGGGTTT GCCATGGCCC GCGACCGGGC CGAAGCGCTG GGAGTCGACA GTCTCGCCGA CCTTGCTTCC CAGGCCGAAG GCATGAGTGT CGGAGGCGAT GCGGAGGTCT TTTCGCGGCC CGAATGGGTG AACACCCGCA ACCGCTACGG CCTCGGGGCG ATGCAAACCC GCGCCATGGA TGCCGCCTTC ATGTATGGCG CTGTGCGGGA CGGCCAGGTC GACTTGATCT CTGCCTATAC GACTGACGGC CGCATTGCGG CGTTTGACCT GCTGGTCCTC GCCGATCCAC TGGCGGTGTT GCCGCCCTAT GACGCGGTGA TCCTTCTGTC GCCCGATGCG GCAGCCAATC CGGCCCTGGT CGAGGCGTTG TCCGGCCTGA TTGGTGCGAT CGACAGTGAA GCGATGCGTG AGGCCAATCG CCTTGTCGAT CTGGATCGTC AGACGCCGGA CCAGGCGGCA ATCTGGCTGT CGGACCATTT GCAGGCGGGT GCGGGAGGCA ATCAATGA
|
Protein sequence | MSSNLAERWA ELPDLLAGHM LLSLAAILVG LSISLPLGIL AASRPRLAAV VLNAASVIQT IPGLALLALM VPLLGGMIGY APAFLALMLY SILPILRNTI VGLQGLDPVV REAARGIGMT PRERLWQVEL PLALPVIVAG LRTAVVWVVG AATLATPVGA SSLGNYIFAG LQTRNWLSVL FGCLFAAGLA IALDQIIRLL EVAARDRRKG LAVAVGVALS ALVVITIAPR YVPAAVFRGA GPVVALQDGA DTLADQTVVV GAKAFTEQYI LGALMQTRLE EAGARVDRLD NLGSTIAFDA LRTGEIDVYI DYTGTIWATV MNRSEPINRN AMLAEMTGWL YAEHGILALG GLGFENAYGF AMARDRAEAL GVDSLADLAS QAEGMSVGGD AEVFSRPEWV NTRNRYGLGA MQTRAMDAAF MYGAVRDGQV DLISAYTTDG RIAAFDLLVL ADPLAVLPPY DAVILLSPDA AANPALVEAL SGLIGAIDSE AMREANRLVD LDRQTPDQAA IWLSDHLQAG AGGNQ
|
| |