Gene MmarC7_1221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC7_1221 
Symbol 
ID5328369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C7 
KingdomArchaea 
Replicon accessionNC_009637 
Strand
Start bp1195987 
End bp1197465 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content39% 
IMG OID640793774 
Productsodium/proline symporter 
Protein accessionYP_001330435 
Protein GI150403141 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATCAG ATAATTTGAG TATCGTTTTG ATATTCATGC TCTATTTGCT CGTGGTAATG 
GGCGTAGGTA TGTATTTCTA CAGGCGAAAC GAAACGATAA GCGATTATGT GCTTGGTGGC
AGAAAATTAA ATAGCTGGGT TGCAGCGTTA AGTGCACAAG CCTCAGACAT GAGCGGTTGG
CTTTTAATGG GTCTTCCAGG AGTTGCATAT CTTTCTGGAA TGAGTGAAAT ATGGATTGGC
GTAGGTCTTG CAATAGGAAC TTACCTAAAC TGGAAGTTCG TTGCAGAACG GCTTAGAAGA
TACACAGAAA TTGCAAAAGA TTCTATTACA ATACCTGTTT ACTTGGAAAA CAGATTTAGA
GATCAGTCTA AATTATTAAG AATTGTTTCA GCGTTTTTTA TTATGCTATT TTTCTTACTG
TACACGTCTT CAGGATTAGT CGCAGGCGGA AAGTTGTTCA ACTTAGTTTT TGGAGTAGAT
TATACTCTTG CAGTTACTAT CGGGGCGTTA GTAATTATTG GTTATACATT CCTTGGTGGT
TTCCTTGCAG TTAGCTGGAC AGACTTTATA CAAGGCTCTC TTATGTTTAT TGCAATATTC
TTAATTCCTA TCATGGGAAT TGTTCACATG GGCGGAATTG ATGCTACAAT GAATGCATGG
AATGTAATAA GTCCAGATTA CATAAATCCA TTTACCGACC TTGATGGAGA AGCTCTCGGT
GTAATGGGGC TTGCATCAGC TCTTGCATGG GGTTTAGGGT ACTTTGGAAT GCCTCACATC
CTTGTAAGAT TCATGGCAAT TAAATCAGCT GATAAAATTC CAAAAGCAAG AAAAATTGCA
ACTACTTGGG TTGTAATCAG CCTTTTCATG GCAGTTCTTG TTGGAATGGT TGGTGCAGTG
GCTCTTGGAG CTCCACTGGA CGATCCAGAG CACGTATTCA TGGCAATGGC AAAAGGATTA
TTCCCAAGTC TTATTGCAGG TGTATTTTTA GCTGGGGTTC TAGCAGCTAT CATGAGTACT
GCAGATTCAC AGCTTTTAGT TACTGCTTCA GCAATTACTG AAGATATTTA TGCATTATTA
AACAAAAATG CAAGTCAAAA AGAGCTTTTA TGGATAAGCA GGTTTGCAGT AATTGCTGTG
GCGGCAATAG CGTACTACTT TGCAATAGTT CCTGGAAGTA GCGTTATGGG ACTTGTTTCA
TATGCGTGGG CAGGATTTGG TGGTGCATTT GGGCCTGTAA TCTTACTTTC ATTATACTGG
AAGAGAATGA CAAGAAATGG TGCTCTTGCA GGTCTGCTTT CTGGTGGATT CATGGTAATT
CTCTGGAAAA ACTTGAGCGG TGGAATATTT GATTTATACG AAATCGTTCC AGCATTTTTG
CTCGCATCAA TAATGATTAC AGTTGTAAGT TTACTTGACA AAGAACCTTC ATTAGAAATT
CAGGAAGAGT TCGACAGAGC AATCTCTGAA ATGAAGTAG
 
Protein sequence
MISDNLSIVL IFMLYLLVVM GVGMYFYRRN ETISDYVLGG RKLNSWVAAL SAQASDMSGW 
LLMGLPGVAY LSGMSEIWIG VGLAIGTYLN WKFVAERLRR YTEIAKDSIT IPVYLENRFR
DQSKLLRIVS AFFIMLFFLL YTSSGLVAGG KLFNLVFGVD YTLAVTIGAL VIIGYTFLGG
FLAVSWTDFI QGSLMFIAIF LIPIMGIVHM GGIDATMNAW NVISPDYINP FTDLDGEALG
VMGLASALAW GLGYFGMPHI LVRFMAIKSA DKIPKARKIA TTWVVISLFM AVLVGMVGAV
ALGAPLDDPE HVFMAMAKGL FPSLIAGVFL AGVLAAIMST ADSQLLVTAS AITEDIYALL
NKNASQKELL WISRFAVIAV AAIAYYFAIV PGSSVMGLVS YAWAGFGGAF GPVILLSLYW
KRMTRNGALA GLLSGGFMVI LWKNLSGGIF DLYEIVPAFL LASIMITVVS LLDKEPSLEI
QEEFDRAISE MK