Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_1952 |
Symbol | |
ID | 3997853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | + |
Start bp | 2049253 |
End bp | 2051169 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637959694 |
Product | Na+/solute symporter |
Protein accession | YP_566583 |
Protein GI | 91773891 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGGAT ATCAATTGTT TATGGCAATG CTTGCAGTTT ACATATGTGG ACTCATATGC ATTGGTTGGT ACTTTACAAA AAGACAGCAA ACAGTAACTG ATTTCTGGCT GGCAGGACGT AAGATCGGCA CAATAGGAAT AGGATTCTCC TCTGCCGCAT CATGGCTGAC CGCTGGTGGA ATATTAGCAG TTATTGGTTT CTTTATGCTA CTTGGAATGG GCTCCATTTG GGGGTTTGTA GCTCCTAACA TTCTGGCATT GCTTATAATC GCTATTTTTG TCAAGAAGTT CAAGAATCTA CCTTCTATCA CACAGGCTGA ACTTCTCGAA CAGAGGTACA GTTCTGCAAT ACGTGCTCCT GTAGGTATTA TTATCACTAT TGTCATGATC CTTTTTGCGG TAGCAGATAT CAAAGGATTC GCTCTTGTGT TGCAGATATT CTATGGCGTG GATCCGATCT ATGCGGCACT TATCGTTGCT CTCGCTGTTT CTGCGTATGT GACCCTCGGT GGTCTCAGTG CGGTAGTATG GACTGATGTC GTGCAGTTCG TTTTCCTTTC TATCTTTGCT CTTGCAATGG CATTTCTGGC GATCGGGGCA GCAACTTCAG GTTCCGCGGA TATTTCTTCT GCATCTGACC TGATATCAAA TGTATCCACA GATTGGTGGA ATCCGTTCAT CATAGGCATT CCGATGGTCC TTATCTTTGT TTTCGCTATC ATCCCCGGAT GGATAACAGA GCAGGATCCA TGGCAAAAAG TATGGGCGGC TAAAGACTCG ACATCTGCAA GGAATGGTCT TGTACTTGGC TCTTTCCTTG TAACAGTAGT ATTCACTGCA TGTGCGTTCA TTGCTATTGG ATTGAACTCA CTGTATCCTG AGATATCTGC GATGGGTTTC CCTATGGGAA TGGGGCTGGC AGAACCTGCA CTTCTTACAT TCATTGTAGA GACATTCTCT CCTGCTGTCA TCGGGCTCTG TGCGATTGGT CTTGCAGCAG CATCAATGTC CTGTGCGGAT ACTTTTGCAA CCTCGGGGGC ATCATGTGTT TCACGTGACA TCTACCAGAG GTTTGTCAAA CCGGACGCTA CAATGAAGCA GATGCTCACT ATCAACAGGC TGAGCGTTCT TTTTATTGTT GCAGCAGCAA CGGTTTCTTC GTTCTTCATA AACGGTATCC TCGATGCAAT TCACATTGCC ACTTTCATTG CAAGTGCATC CTATTTCTTC CCTCTCATGG GTGGACTTTA CTGGAAGCGC GCAACAAAAG AAGGTGCATT AGCAGGGCTT ATTCTCGGTG GTGTGGCACA GATATCATTT ACTGTGTATG ACCTTTTAAT GACCGCACCC ATGGCACCTC CATACCTTGA GACCGTTCAC CCAATTCTCA TGAACCACGG TGTAATTGTA GGAATGGGAT TGAGTGGAAT TGCTTTCTTT GGTGTATCCC TTCTGACAAA GCCATCGAAT GTCATCAATC TTGCTCCTTT CTTTAAGGAT GTGGCAGAAG AATTGGCCAG TCATGATGCA CAGGAGGTCG ATGATCAGTC TTTAGAATAT CAGAATTTCC TCAAGACACT CGATGAACAG ATCACCGGGG AACGTGCACA TCTTCACTTG AGGCTCGAAA GCTCAGCTAC AGTGAACTGG CGCAAGTTCA TAGAACAGCT AAGGGAGGCT TATCCTGTAT GGGTAACTCC GACTGGACTT GATTCAGTCT ACAGGCTCAT CCAGGCGGAC ATGCTTGCCT GTGTCTCTAT CACACGTGGT GAGAATGAAA AGGAGATTTG GTTCGCATCA GAACCACAAG TCGATTCTGT TGAAATGCAG AAGAAAGAGG TCTTTATTGC ATACAAAGAA GTATCAAAGG CTCTCGAAAA TGTCGGTATT CTCCTGACAA TTCCAAGCGA AGACTAA
|
Protein sequence | MDGYQLFMAM LAVYICGLIC IGWYFTKRQQ TVTDFWLAGR KIGTIGIGFS SAASWLTAGG ILAVIGFFML LGMGSIWGFV APNILALLII AIFVKKFKNL PSITQAELLE QRYSSAIRAP VGIIITIVMI LFAVADIKGF ALVLQIFYGV DPIYAALIVA LAVSAYVTLG GLSAVVWTDV VQFVFLSIFA LAMAFLAIGA ATSGSADISS ASDLISNVST DWWNPFIIGI PMVLIFVFAI IPGWITEQDP WQKVWAAKDS TSARNGLVLG SFLVTVVFTA CAFIAIGLNS LYPEISAMGF PMGMGLAEPA LLTFIVETFS PAVIGLCAIG LAAASMSCAD TFATSGASCV SRDIYQRFVK PDATMKQMLT INRLSVLFIV AAATVSSFFI NGILDAIHIA TFIASASYFF PLMGGLYWKR ATKEGALAGL ILGGVAQISF TVYDLLMTAP MAPPYLETVH PILMNHGVIV GMGLSGIAFF GVSLLTKPSN VINLAPFFKD VAEELASHDA QEVDDQSLEY QNFLKTLDEQ ITGERAHLHL RLESSATVNW RKFIEQLREA YPVWVTPTGL DSVYRLIQAD MLACVSITRG ENEKEIWFAS EPQVDSVEMQ KKEVFIAYKE VSKALENVGI LLTIPSED
|
| |