Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MmarC7_1221 |
Symbol | |
ID | 5328369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcus maripaludis C7 |
Kingdom | Archaea |
Replicon accession | NC_009637 |
Strand | + |
Start bp | 1195987 |
End bp | 1197465 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640793774 |
Product | sodium/proline symporter |
Protein accession | YP_001330435 |
Protein GI | 150403141 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR02121] sodium/proline symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATCAG ATAATTTGAG TATCGTTTTG ATATTCATGC TCTATTTGCT CGTGGTAATG GGCGTAGGTA TGTATTTCTA CAGGCGAAAC GAAACGATAA GCGATTATGT GCTTGGTGGC AGAAAATTAA ATAGCTGGGT TGCAGCGTTA AGTGCACAAG CCTCAGACAT GAGCGGTTGG CTTTTAATGG GTCTTCCAGG AGTTGCATAT CTTTCTGGAA TGAGTGAAAT ATGGATTGGC GTAGGTCTTG CAATAGGAAC TTACCTAAAC TGGAAGTTCG TTGCAGAACG GCTTAGAAGA TACACAGAAA TTGCAAAAGA TTCTATTACA ATACCTGTTT ACTTGGAAAA CAGATTTAGA GATCAGTCTA AATTATTAAG AATTGTTTCA GCGTTTTTTA TTATGCTATT TTTCTTACTG TACACGTCTT CAGGATTAGT CGCAGGCGGA AAGTTGTTCA ACTTAGTTTT TGGAGTAGAT TATACTCTTG CAGTTACTAT CGGGGCGTTA GTAATTATTG GTTATACATT CCTTGGTGGT TTCCTTGCAG TTAGCTGGAC AGACTTTATA CAAGGCTCTC TTATGTTTAT TGCAATATTC TTAATTCCTA TCATGGGAAT TGTTCACATG GGCGGAATTG ATGCTACAAT GAATGCATGG AATGTAATAA GTCCAGATTA CATAAATCCA TTTACCGACC TTGATGGAGA AGCTCTCGGT GTAATGGGGC TTGCATCAGC TCTTGCATGG GGTTTAGGGT ACTTTGGAAT GCCTCACATC CTTGTAAGAT TCATGGCAAT TAAATCAGCT GATAAAATTC CAAAAGCAAG AAAAATTGCA ACTACTTGGG TTGTAATCAG CCTTTTCATG GCAGTTCTTG TTGGAATGGT TGGTGCAGTG GCTCTTGGAG CTCCACTGGA CGATCCAGAG CACGTATTCA TGGCAATGGC AAAAGGATTA TTCCCAAGTC TTATTGCAGG TGTATTTTTA GCTGGGGTTC TAGCAGCTAT CATGAGTACT GCAGATTCAC AGCTTTTAGT TACTGCTTCA GCAATTACTG AAGATATTTA TGCATTATTA AACAAAAATG CAAGTCAAAA AGAGCTTTTA TGGATAAGCA GGTTTGCAGT AATTGCTGTG GCGGCAATAG CGTACTACTT TGCAATAGTT CCTGGAAGTA GCGTTATGGG ACTTGTTTCA TATGCGTGGG CAGGATTTGG TGGTGCATTT GGGCCTGTAA TCTTACTTTC ATTATACTGG AAGAGAATGA CAAGAAATGG TGCTCTTGCA GGTCTGCTTT CTGGTGGATT CATGGTAATT CTCTGGAAAA ACTTGAGCGG TGGAATATTT GATTTATACG AAATCGTTCC AGCATTTTTG CTCGCATCAA TAATGATTAC AGTTGTAAGT TTACTTGACA AAGAACCTTC ATTAGAAATT CAGGAAGAGT TCGACAGAGC AATCTCTGAA ATGAAGTAG
|
Protein sequence | MISDNLSIVL IFMLYLLVVM GVGMYFYRRN ETISDYVLGG RKLNSWVAAL SAQASDMSGW LLMGLPGVAY LSGMSEIWIG VGLAIGTYLN WKFVAERLRR YTEIAKDSIT IPVYLENRFR DQSKLLRIVS AFFIMLFFLL YTSSGLVAGG KLFNLVFGVD YTLAVTIGAL VIIGYTFLGG FLAVSWTDFI QGSLMFIAIF LIPIMGIVHM GGIDATMNAW NVISPDYINP FTDLDGEALG VMGLASALAW GLGYFGMPHI LVRFMAIKSA DKIPKARKIA TTWVVISLFM AVLVGMVGAV ALGAPLDDPE HVFMAMAKGL FPSLIAGVFL AGVLAAIMST ADSQLLVTAS AITEDIYALL NKNASQKELL WISRFAVIAV AAIAYYFAIV PGSSVMGLVS YAWAGFGGAF GPVILLSLYW KRMTRNGALA GLLSGGFMVI LWKNLSGGIF DLYEIVPAFL LASIMITVVS LLDKEPSLEI QEEFDRAISE MK
|
| |