Gene Mbar_A2771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2771 
Symbol 
ID3625099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3520722 
End bp3521867 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content47% 
IMG OID637701623 
ProductL-sorbosone dehydrogenase 
Protein accessionYP_306253 
Protein GI73670238 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.069543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0541194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGAA CAAGAGTAAT ACTTGGTTTT TTTGCAATCA TTGCACTTAT CTTTGTAGCT 
TATGCGGTTC TCAGAACCAT ACCACTTAAT GATTGCGGAG ATAACGGGAC AGGACTTAAC
TGTATCGAGC TTCCACCCGG TTTTTCCATT GATTATTATG CTGAGAATAT TGAGGGTGCA
AGGTCAATGG CGCTTAGCCC TAATGGCACT ATCTTTGTCG GAAGCCGGGA TACAGGGAAA
GTTTATGCAG TTCTTGACCG GAACAATGAT AGTAAAGCCG ACGAGGTTCT TGTGCTTGCT
GAGGGCCTAG ACATGCCAAA CGGGGTGGCG TTCAGGAACG GTTCGCTATA TGTCGCTGAG
GTCTCCAGGG TGATCCGCTA TGATGACATC GAAGCAAGGC TTGAAAATCC GCCTGAACCT
ATTGTAGTGA ACGATAATTT CCCGTCCGAT CGCTCACACG GCTGGAAATA TATCAAATTC
GGGCCTGACG GAAAGCTGTA TGTGCCTGTA GGAATGCCGT GCAATGTTTG TAATAAGGAG
GGAGAGGACG AAAGGTACGG AACAATCATG AGAATGGAAC CTGATGGAAG CCAGCTTGAG
ATTTTTGCAA AGGGTGTCAG GAACAGTGTC GGGTTTGACT GGAATCCCAG GACCGGAGAA
TTATGGTTTA CTGACAACGG AAGGGACTGG CTTGGAGATG ATGAGCCTCC GGATGAACTT
AATAGGGCGC CTGTAAAGGG AATGCATTTC GGTTTTCCTT ATTGCCACGG TGGAGATATT
CCAGATCCTG AATACGGGAA ACTCAGGAAC TGCTCAGAAT TCACACCTCC TGAAATGAAG
CTGGGGCCTC ATGTGGCTGC CCTTGGAATG ACTTTTTACA CAGGTACAAT GTTTCCTGAA
GAGTACAGAA ACCAGATCTT CATTGCAGAA CACGGCTCCT GGAACAGAAA AATTCCAATT
GGATACCGGG TTTCTCTTGT CAGGCTGGAG AACGGAAAGC CTGTAAGTTA CGAACCTTTT
GCTAATGGCT GGCTTCAGGG ACTTGCGGCC TGGGGAAGGC CTGTGGATGT TCTTGTGATG
CCTGATGGGG CACTGCTTGT TTCGGATGAC AAAAATAATG CAATCTACAG GATCAGCTAC
AGCTGA
 
Protein sequence
MKGTRVILGF FAIIALIFVA YAVLRTIPLN DCGDNGTGLN CIELPPGFSI DYYAENIEGA 
RSMALSPNGT IFVGSRDTGK VYAVLDRNND SKADEVLVLA EGLDMPNGVA FRNGSLYVAE
VSRVIRYDDI EARLENPPEP IVVNDNFPSD RSHGWKYIKF GPDGKLYVPV GMPCNVCNKE
GEDERYGTIM RMEPDGSQLE IFAKGVRNSV GFDWNPRTGE LWFTDNGRDW LGDDEPPDEL
NRAPVKGMHF GFPYCHGGDI PDPEYGKLRN CSEFTPPEMK LGPHVAALGM TFYTGTMFPE
EYRNQIFIAE HGSWNRKIPI GYRVSLVRLE NGKPVSYEPF ANGWLQGLAA WGRPVDVLVM
PDGALLVSDD KNNAIYRISY S