Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_1785 |
Symbol | |
ID | 3997766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | + |
Start bp | 1885246 |
End bp | 1886661 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637959534 |
Product | sodium/glutamate symporter |
Protein accession | YP_566423 |
Protein GI | 91773731 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0786] Na+/glutamate symporter |
TIGRFAM ID | [TIGR00210] sodium--glutamate symport carrier (gltS) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGCTG CAATGATCGG AATGAGCTTT CTCGTACTTG GTGTCATCCT GCTTCTTGGG AAATGGATAA GGGTTATGTC ACCCCCCATC CAAAAACTGT TCATTCCCAG TTCTCTTATC GGTGGATTCC TGGCTTTATT TCTTGGACCT GAGGTCCTGG GTTATTTGAT CTCATGGATA GGAGATAGCA GTACGTTCCT ATCAGGTGGG ATATTTCCAG AAGAGATGCT TGATGCGTGG ACCACTCTGC CTGGGCTTTT CATAAACATC ATATTTGCAA CACTTTTTCT TGGGAAAAAA CTACCCGCGA TCAAAGACAT ATGGCTTCTT GCAGGGCCAC AAATTGCCCA CGGACAGACC ATAGCATGGG GTCAGTATGT GTTCGGCATA TTGGCTGCAA TCCTGATATT GACCCCTTTT TTTGGAATGG ACCCAATGGC AGGTGCCCTT CTAGAAATAT CATTTGAAGG CGGGCATGGT ACTGCAGCTG GTATGAGCGC TACTTTTGAG GAACTTGGCT TTTCAGATGC AACAGACCTG GCACTTGGGC TTGCAACTGT GGGTATTCTC TTTGGTGTTA TTCTTGGGAT CGTGCTTTTG AACTATGGTG TAAGGTCAGG AAAAACAAGT GTGCTGAAAG ACCAATCCCA GTTATCCCTT AGTGAAACCT ATCAGAAAGG CATCATTGAT TTTGACGCAA GGGAGTCTGC GGGGAAGATA ACTACAAGAC CCGAATCCAT CGAGCCGCTT TCACTTCATT TTGCTTATGT GGGAGTTGCT ATTGGGATCG GGTATTTGAT CCTTCAGGCA CTGATATGGA TAGAGGCGAT CACTTGGGGC CAGGCAACGG GAATATATCT GCTTGCCCAT TTACCTCTTT TTCCTCTGGC AATGATAGGA GGCATCATAC TACAGATGTT CCTTGATAAG TTCGACCCGT ACTATACTCT GGACAGAGAT CTTATGATGA GGATACAGGG TTTATCTCTT GACATTCTGA TAACCAGTGC AATAGCCACA CTGTCACTAA CGGTTATCGG AAACAATCTG ATGCCTTTCG TCATACTTGC TACAGTTGGA ATTGTATGGA ACCTGATAGC GTTCTTATAC CTTGGACCAA AAATGATGCC TTCATACTGG TTCGAAAAGA GCATAGGTAA TTTTGGACAG TCCATGGGGA TGACGGCCAG TGGCCTGTTG TTAATGAGGA TAGCCGATCC TGCTTCAAAA TCTCCTGCCC TTGAGGGATT TGGTTATAAA CAGCTGCTAT TTGAACCGAT AGTGGGTGGT GGGATATTTA CAGCTGCCTC TGTACCCCTG ATATTCTATT TTGGCCCGAT GCCGATACTT ATCATGACAT CGGTTATCAT GGTATTCTGG GCAGGACTAG GGGTCTTTTA CTTTGGCCGA AAATAA
|
Protein sequence | MSAAMIGMSF LVLGVILLLG KWIRVMSPPI QKLFIPSSLI GGFLALFLGP EVLGYLISWI GDSSTFLSGG IFPEEMLDAW TTLPGLFINI IFATLFLGKK LPAIKDIWLL AGPQIAHGQT IAWGQYVFGI LAAILILTPF FGMDPMAGAL LEISFEGGHG TAAGMSATFE ELGFSDATDL ALGLATVGIL FGVILGIVLL NYGVRSGKTS VLKDQSQLSL SETYQKGIID FDARESAGKI TTRPESIEPL SLHFAYVGVA IGIGYLILQA LIWIEAITWG QATGIYLLAH LPLFPLAMIG GIILQMFLDK FDPYYTLDRD LMMRIQGLSL DILITSAIAT LSLTVIGNNL MPFVILATVG IVWNLIAFLY LGPKMMPSYW FEKSIGNFGQ SMGMTASGLL LMRIADPASK SPALEGFGYK QLLFEPIVGG GIFTAASVPL IFYFGPMPIL IMTSVIMVFW AGLGVFYFGR K
|
| |