Gene Mbur_0488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_0488 
Symbol 
ID3998326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp482367 
End bp483497 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content44% 
IMG OID637958301 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_565221 
Protein GI91772529 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1125] ABC-type proline/glycine betaine transport systems, ATPase components 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00399049 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCCA AAAGGTTATT CGACAGAATC GATTCGATAA AACTTCGCGG TGTGACGAAG 
AAATATGATG ACAGGTTTGC TATCAATGAT GTTTCTATTG ATATTGAAGG TGGGGAGCTG
GTCATTTTCA TAGGCCCCAG TGGGTCCGGG AAGACCACAA CACTGCGTAT GATCAATCGT
TTGATAGAAC CCGATTCAGG GACTATTCTC ATCAATGACC AGAATGTCAT GGAACTTGAG
CCGGTTGCCC TTCGCAGGAA CATAGGTTAT GTTATACAGA GTATCGGTCT TTTCCCTCAC
ATGACCATTG CCGAGAATAT TGGTCTCGTG GCCAAACTGG AAGGCTGGAA TGAGAAAAAG
ATCAAAGACA GGGTAGAATA CCTCCTTGAT TTTGTTTCCC TTCCGTCTGA GATGTTCATG
GATAGGTATC CTCATCAACT AAGCGGTGGA CAACAGCAAA GAGTTGGACT CGCAAGGGCA
CTTTTGATGG ACCCTCCCCT TTTGCTCATG GACGAACCCT TCGGTGCACT TGACCCGATC
TTAAGGAAAC AACTTCAGGA AGAGTTCTAC CAGATAAGGG AAAAACTGGG TAAGACCATA
ATATTCGTGA CACACGATAT CGAAGAAGCT TTCAAGCTCG GTGACAGGAT CGCAATAATG
GATAATGCGA AACTTGTTCA GATAGGCACA GCTGAAGAAT TGATATTTCA TCCCGCAAAC
GAAATGGTGG CAAGCATTGT AGATTCCGGT AAGAAGTTCA AGCACCTTGA TACGTTGAAA
ATAAAGGACC TCATATCCCC CCTTGAATGC ATGTATGTCC ACAATGGATC ACTTGACATC
GAGAGTGCTA TCAGTTCCAT GATAGAGAAG AACATCGAGA TCGCTGTGGT TTCTAATGGT
TCGGGTCCGC TGGGTATTGT AAAGCTTATT GATCTATTGC GTATGGATGA TAAGGACAGC
AAGATTGCAG ATCATGTTGT TGAGATCCCT TCATTTTCCA GAAATGAACT GCTCTCATCC
TCACTGAAAA TAATGCAGAA GAATGGTCAT TCGATGGCCT TTGTCATGAC CGATGAAGAA
CTAAGCGGAT TCCTGTTTCC AAATGATGCT TTCAGTCAGG TAATTGGATA A
 
Protein sequence
MPSKRLFDRI DSIKLRGVTK KYDDRFAIND VSIDIEGGEL VIFIGPSGSG KTTTLRMINR 
LIEPDSGTIL INDQNVMELE PVALRRNIGY VIQSIGLFPH MTIAENIGLV AKLEGWNEKK
IKDRVEYLLD FVSLPSEMFM DRYPHQLSGG QQQRVGLARA LLMDPPLLLM DEPFGALDPI
LRKQLQEEFY QIREKLGKTI IFVTHDIEEA FKLGDRIAIM DNAKLVQIGT AEELIFHPAN
EMVASIVDSG KKFKHLDTLK IKDLISPLEC MYVHNGSLDI ESAISSMIEK NIEIAVVSNG
SGPLGIVKLI DLLRMDDKDS KIADHVVEIP SFSRNELLSS SLKIMQKNGH SMAFVMTDEE
LSGFLFPNDA FSQVIG