Gene MmarC5_1465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC5_1465 
Symbol 
ID4927613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C5 
KingdomArchaea 
Replicon accessionNC_009135 
Strand
Start bp1401778 
End bp1403256 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content39% 
IMG OID640166960 
Productsodium/proline symporter 
Protein accessionYP_001097976 
Protein GI134046491 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATCTG AAAATTTGAG TATCGTTTTG ATCTTCATGC TCTATTTGCT CGTGGTAATG 
GGCGTGGGTA TGTATTTCTA CAGGCGAAAC GAAACTATAA GCGATTATGT GCTTGGTGGT
AGAAAATTAA ATAGCTGGGT TGCGGCATTA AGTGCGCAAG CTTCAGACAT GAGCGGTTGG
CTTTTAATGG GTCTTCCGGG AGTTGCATAT CTTTCTGGAA TGAGTGAAAT ATGGATAGGA
GTTGGTCTTG CAATAGGAAC TTACCTAAAC TGGAAGTTCG TTGCAGAAAG GCTTAGAAGA
TACACAGAAA TTGCAAAAGA TTCTATTACA ATACCTGTTT ACTTGGAAAA CAGGTTTAGG
GATCAGTCTA AAATGTTAAG AATTGTTTCA GCGTTTTTTA TTATGCTATT TTTCTTATTG
TACACGTCTT CAGGATTAGT TGCGGGCGGA AAATTGTTCA ATCTTGTATT TGGAGTAGAT
TATACTCTCG CAGTTACAAT AGGTGCTTTA GTAATTATTG GTTATACATT CCTCGGCGGT
TTCCTTGCAG TTAGCTGGAC TGACTTTATA CAAGGTTCCC TCATGTTTAT TGCAATATTC
TTAATTCCAA TCATGGGTAT TGTCCACATG GGCGGAATTG ACGCTACAAT GAATGCTTGG
AATTCAATAA GTCCAGATTA CATAAATCCA TTTACAAATC TCGATGGAGA AGCTCTTGGT
GCAATGGGGC TTGCATCAGC TCTTGCATGG GGTCTTGGAT ACTTTGGAAT GCCACACATC
CTTGTAAGGT TTATGGCAAT TCAATCAGCT GATAAAGTTC CAAAAGCAAG AAGAATTGCG
ACTACCTGGG TTGTAATCAG TCTTTTCATG GCAGTTCTTG TTGGAATGAT TGGTGCAGTA
GCTCTTGGAG CACCGCTTGA TGATCCAGAG CATGTATTCA TGGCAATGGC ACAAGGATTA
TTCCCAAGTC TTATTGCAGG GGTATTTTTG GCAGGTGTTT TAGCAGCTAT CATGAGTACT
GCAGATTCAC AGCTTTTAGT TACTGCTTCG GCAGTTACTG AAGATATTTA TGCATTATTA
AATAAAAATG CAAGTCAAAA AGAGCTTTTA TGGATAAGCA GGTTTGCAGT AATTGCTGTG
GCGGCAATAG CTTACTACTT TGCAATAGTT CCTGGAAGCA GCGTTATGGG GCTTGTTTCA
TACGCATGGG CAGGATTTGG TGGTGCATTT GGTCCAGTGA TATTGCTTTC ATTATACTGG
AAGAGAATGA CTAGAAATGG TGCTCTTGCA GGTTTACTTT CCGGCGGATT TATGGTAATT
CTCTGGAAAA ACTTGAGCGG TGGAATATTT GATTTATACG AAATCGTTCC AGCATTTTTG
CTCGCAACAA TAATGATTAT AGTTGTAAGT TTAATTGATA AAGAACCTTC ATTAGAAATT
CAGGAAGAGT TCGACAGAGC AGTTTCCGAA ATGAAATAG
 
Protein sequence
MVSENLSIVL IFMLYLLVVM GVGMYFYRRN ETISDYVLGG RKLNSWVAAL SAQASDMSGW 
LLMGLPGVAY LSGMSEIWIG VGLAIGTYLN WKFVAERLRR YTEIAKDSIT IPVYLENRFR
DQSKMLRIVS AFFIMLFFLL YTSSGLVAGG KLFNLVFGVD YTLAVTIGAL VIIGYTFLGG
FLAVSWTDFI QGSLMFIAIF LIPIMGIVHM GGIDATMNAW NSISPDYINP FTNLDGEALG
AMGLASALAW GLGYFGMPHI LVRFMAIQSA DKVPKARRIA TTWVVISLFM AVLVGMIGAV
ALGAPLDDPE HVFMAMAQGL FPSLIAGVFL AGVLAAIMST ADSQLLVTAS AVTEDIYALL
NKNASQKELL WISRFAVIAV AAIAYYFAIV PGSSVMGLVS YAWAGFGGAF GPVILLSLYW
KRMTRNGALA GLLSGGFMVI LWKNLSGGIF DLYEIVPAFL LATIMIIVVS LIDKEPSLEI
QEEFDRAVSE MK