Gene Mbur_1205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_1205 
Symbol 
ID3998361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp1294806 
End bp1296026 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content47% 
IMG OID637958972 
Productpeptidase U32 
Protein accessionYP_565878 
Protein GI91773186 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTGCT CTGGATCATC TGTAAAGATC CCTGAACTTA TGATGGGTGT CAAGAATCGT 
GCATCTCTGG CTGCGTGCAG GGACTATGCA GACGGGGTCT ACTTCTCCAT CGACAGATTC
AGTCTCAGAG CAAGAGCATG TGATATTACA CTTGATGGTC TCAACGATTT TGTAGGGGAT
ATCAAAGAGA ATGATCTCAA TGCTTATCTT GCGCTGAATA CGGTGATCTA TCCCGATGAC
CTTGATGATC TTGATGTTGT TATCGATTCA GTCGCATCCT CGGATGTTGA TGCTGTCATT
GCATGGGATC CGGCAGTAAT AACAAAAGCA GTAGATGCCG GACTGAGGGT TCATATATCT
ACTCAGGCGA ATGTATCGAA CTGGCAGACA GTAGAATTCT ATGGTTCTTT GGGGGCTTCC
CGGGTCGTTC TGGCAAGGGA ACTCAGTATG GAGAATATAA AGGAGATCCG CTGGAACACT
GATGTTGAGC TTGAGGTTTT CATTCATGGT GCCATGTGTC AGGCTATCTC TGGCAGGTGC
TACCTTTCCG CTTATATTCT GGGCAAGTCC GGTAATTGTG GTGAGTGTTC CCAACCGTGC
CGATGGGGTT GGAAACTTGT TGGTGAGGAT GGCAGCGAGG TCGACCTTGA AGGGAAATAT
CTGTTAAGTG CGAGGGACCT GTGCATGATC GAACACATCC CTGAGCTGAT AGATACTGGT
GTGAATGCAT TCAAGGTCGA AGGCAGGTTG AAGGATGCTA GATACACATC TATTGTATCC
CGCTGTTATC GTGAAGCCCT CAATTCCTAT TGTGATGGTT CATATACACT TGAAAAAGCG
AGGTCATGGA AAGACGAGCT GGCTTCGGTA TTTAATCGTG GCTTTTCCAC GGGGTTCTAC
TTCGGGGTAC CCGGTCCGGA TGGTATTTCC ATCGAATCTG ATATGAACGT ATCAACTACC
AAAAGACACG CTGTAGGGGT GGTTACCAAT TATTACAGGA AGAGCGGGGC GGCGGAAGTA
AAGCTTCTCG AAACGGGTAT CGCTGTTGGA GACCACATTA TCATCGAAGG CAAAAGTACA
TACTTTGAAC AGGATATCAC TGAAATAAGG TCGGATGAAG GTCCTGTCCT ATCTGCATCC
TCAGGGGATA TCGTGGGAAT TGCTGTCAAG GATAAGGTGC GTGAAAATGA CAGGGTATAC
AGGTTGGAGA TCCCGGACTG A
 
Protein sequence
MNCSGSSVKI PELMMGVKNR ASLAACRDYA DGVYFSIDRF SLRARACDIT LDGLNDFVGD 
IKENDLNAYL ALNTVIYPDD LDDLDVVIDS VASSDVDAVI AWDPAVITKA VDAGLRVHIS
TQANVSNWQT VEFYGSLGAS RVVLARELSM ENIKEIRWNT DVELEVFIHG AMCQAISGRC
YLSAYILGKS GNCGECSQPC RWGWKLVGED GSEVDLEGKY LLSARDLCMI EHIPELIDTG
VNAFKVEGRL KDARYTSIVS RCYREALNSY CDGSYTLEKA RSWKDELASV FNRGFSTGFY
FGVPGPDGIS IESDMNVSTT KRHAVGVVTN YYRKSGAAEV KLLETGIAVG DHIIIEGKST
YFEQDITEIR SDEGPVLSAS SGDIVGIAVK DKVRENDRVY RLEIPD