Gene Mbur_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_1952 
Symbol 
ID3997853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp2049253 
End bp2051169 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content45% 
IMG OID637959694 
ProductNa+/solute symporter 
Protein accessionYP_566583 
Protein GI91773891 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGGAT ATCAATTGTT TATGGCAATG CTTGCAGTTT ACATATGTGG ACTCATATGC 
ATTGGTTGGT ACTTTACAAA AAGACAGCAA ACAGTAACTG ATTTCTGGCT GGCAGGACGT
AAGATCGGCA CAATAGGAAT AGGATTCTCC TCTGCCGCAT CATGGCTGAC CGCTGGTGGA
ATATTAGCAG TTATTGGTTT CTTTATGCTA CTTGGAATGG GCTCCATTTG GGGGTTTGTA
GCTCCTAACA TTCTGGCATT GCTTATAATC GCTATTTTTG TCAAGAAGTT CAAGAATCTA
CCTTCTATCA CACAGGCTGA ACTTCTCGAA CAGAGGTACA GTTCTGCAAT ACGTGCTCCT
GTAGGTATTA TTATCACTAT TGTCATGATC CTTTTTGCGG TAGCAGATAT CAAAGGATTC
GCTCTTGTGT TGCAGATATT CTATGGCGTG GATCCGATCT ATGCGGCACT TATCGTTGCT
CTCGCTGTTT CTGCGTATGT GACCCTCGGT GGTCTCAGTG CGGTAGTATG GACTGATGTC
GTGCAGTTCG TTTTCCTTTC TATCTTTGCT CTTGCAATGG CATTTCTGGC GATCGGGGCA
GCAACTTCAG GTTCCGCGGA TATTTCTTCT GCATCTGACC TGATATCAAA TGTATCCACA
GATTGGTGGA ATCCGTTCAT CATAGGCATT CCGATGGTCC TTATCTTTGT TTTCGCTATC
ATCCCCGGAT GGATAACAGA GCAGGATCCA TGGCAAAAAG TATGGGCGGC TAAAGACTCG
ACATCTGCAA GGAATGGTCT TGTACTTGGC TCTTTCCTTG TAACAGTAGT ATTCACTGCA
TGTGCGTTCA TTGCTATTGG ATTGAACTCA CTGTATCCTG AGATATCTGC GATGGGTTTC
CCTATGGGAA TGGGGCTGGC AGAACCTGCA CTTCTTACAT TCATTGTAGA GACATTCTCT
CCTGCTGTCA TCGGGCTCTG TGCGATTGGT CTTGCAGCAG CATCAATGTC CTGTGCGGAT
ACTTTTGCAA CCTCGGGGGC ATCATGTGTT TCACGTGACA TCTACCAGAG GTTTGTCAAA
CCGGACGCTA CAATGAAGCA GATGCTCACT ATCAACAGGC TGAGCGTTCT TTTTATTGTT
GCAGCAGCAA CGGTTTCTTC GTTCTTCATA AACGGTATCC TCGATGCAAT TCACATTGCC
ACTTTCATTG CAAGTGCATC CTATTTCTTC CCTCTCATGG GTGGACTTTA CTGGAAGCGC
GCAACAAAAG AAGGTGCATT AGCAGGGCTT ATTCTCGGTG GTGTGGCACA GATATCATTT
ACTGTGTATG ACCTTTTAAT GACCGCACCC ATGGCACCTC CATACCTTGA GACCGTTCAC
CCAATTCTCA TGAACCACGG TGTAATTGTA GGAATGGGAT TGAGTGGAAT TGCTTTCTTT
GGTGTATCCC TTCTGACAAA GCCATCGAAT GTCATCAATC TTGCTCCTTT CTTTAAGGAT
GTGGCAGAAG AATTGGCCAG TCATGATGCA CAGGAGGTCG ATGATCAGTC TTTAGAATAT
CAGAATTTCC TCAAGACACT CGATGAACAG ATCACCGGGG AACGTGCACA TCTTCACTTG
AGGCTCGAAA GCTCAGCTAC AGTGAACTGG CGCAAGTTCA TAGAACAGCT AAGGGAGGCT
TATCCTGTAT GGGTAACTCC GACTGGACTT GATTCAGTCT ACAGGCTCAT CCAGGCGGAC
ATGCTTGCCT GTGTCTCTAT CACACGTGGT GAGAATGAAA AGGAGATTTG GTTCGCATCA
GAACCACAAG TCGATTCTGT TGAAATGCAG AAGAAAGAGG TCTTTATTGC ATACAAAGAA
GTATCAAAGG CTCTCGAAAA TGTCGGTATT CTCCTGACAA TTCCAAGCGA AGACTAA
 
Protein sequence
MDGYQLFMAM LAVYICGLIC IGWYFTKRQQ TVTDFWLAGR KIGTIGIGFS SAASWLTAGG 
ILAVIGFFML LGMGSIWGFV APNILALLII AIFVKKFKNL PSITQAELLE QRYSSAIRAP
VGIIITIVMI LFAVADIKGF ALVLQIFYGV DPIYAALIVA LAVSAYVTLG GLSAVVWTDV
VQFVFLSIFA LAMAFLAIGA ATSGSADISS ASDLISNVST DWWNPFIIGI PMVLIFVFAI
IPGWITEQDP WQKVWAAKDS TSARNGLVLG SFLVTVVFTA CAFIAIGLNS LYPEISAMGF
PMGMGLAEPA LLTFIVETFS PAVIGLCAIG LAAASMSCAD TFATSGASCV SRDIYQRFVK
PDATMKQMLT INRLSVLFIV AAATVSSFFI NGILDAIHIA TFIASASYFF PLMGGLYWKR
ATKEGALAGL ILGGVAQISF TVYDLLMTAP MAPPYLETVH PILMNHGVIV GMGLSGIAFF
GVSLLTKPSN VINLAPFFKD VAEELASHDA QEVDDQSLEY QNFLKTLDEQ ITGERAHLHL
RLESSATVNW RKFIEQLREA YPVWVTPTGL DSVYRLIQAD MLACVSITRG ENEKEIWFAS
EPQVDSVEMQ KKEVFIAYKE VSKALENVGI LLTIPSED