Gene Mbar_A2978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2978 
Symbol 
ID3626621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3832324 
End bp3834009 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content36% 
IMG OID637701823 
Producthypothetical protein 
Protein accessionYP_306453 
Protein GI73670438 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4690] Dipeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0567036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.419431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCACAA CAATAATCAT TGGTAAGAAA GCATCCAAAG ACGGTTCGGT AATCATTGCT 
CATTCTGACG ATTTCTTAGG GGATGCAAGA GTTATTAGTG TTCCTTCATT TAACCTGGAA
AATCGAAATG TGTATTATGA TAATGCCTCT TTTGGACTCA ATAAAGCATA TAATTCTACT
GAAATACGCA GGTATATAGG AAAAGACAGA GGAGAGGGGT ATGATACGAA AGATTATACT
TCAAGTAAAC CTCTGGGAGT TATACCTGGT TTTGGTAAAG ATACATATGC CTATTTTGAT
TATGAGTATG GAATAATTAA TGAAAAAGGC TTAATGGTTG GTGAATGTAC ATGTGGGGCT
AAAATACAGC CAGGGCCAGA CCCAAAAAAA AGGATTTTTT ATAGTTCGGA ACTTTCTAGA
GTAGCTTTAG AAAGATGCAC AAAAGCACGG GAAGCAGTGG AATTAATAGG TAAATTAATT
TTTGAATATG GTTATTATGG CACAGGTGAA ACCTTATCAC TTGGTGATGC CGACGAAGGC
TGGGTCATGG AAATGTGCGC CTATGAAGAG GATGGTAATT CTGGTATCTG GGTAGCGCAA
CGTGTACCTG ATGATGAGTT TTTTATTGCA GCCAATCAGT TTAGAATTAG GGATATTCAC
AAAAATAATG AAGACGATGG CAGTGATGAA AAGCTTTATT TCAATTCGTT AGATGAAAAT
GGAAACCTCA GAGACGATTT GTTATATTCT GCCAATCTTT TTAATGCTTG TCATAAAACA
AATTGGATAG CTAGTGATGA AAAATGTATA GATTGGGCAG CAACTGTCAG CTATGGTGAG
TATCTTCACC CCTATTATTC CTTACGTAGG GTATGGAGGG CTTTCTCTAA GGTTGCACCT
TTATCGAATC TGCCTTCGAG AGTAACTGAT GGATATACCA AAGATTATCC TTTTTCTTTA
AAACCAGAGA ACAAATTGTC AATATTGGAT GTCGCCAATG TGTTTAGAGA TTTCTATGAA
GGAACAGAAT TTGATCTAAC GATAGGGCCC GCTTCAGGTC CTTTTAGGAA TCCTATTAGA
TATCAAAATA ATCCTGATCA AGGAGATACT TATGATTTAA ACGTATACAA GCCTGAAGGA
GCGTGGGAGC GCCCACTGTC GAATCATCAA TGTGGCGTTT TGTGGATTAA TCAAGCTATA
AAAGCAAAGG GCAATACGGA AGCTGTTTGC TGGATTGGCT TAGATAGACC ATTTGCTAAT
TGTTTAATGC CCTTTTATTG TAAAATGGAT AAGCTACCTA AAGAATTACA AACTATGAAT
TTATTAGATT TTCAGTTTAA TGGTGACAGT GCATGGTGGG CATTTAATTT TGTGTCAAAT
TTTGTAAACT TGAATTTTCT TTATATGATG CGGGAAGTAA AAGCATTGCA AGAAAGGTTT
GAAACAAAAA CGGAAAAGGA TGTTATGGAA ATACTTTCCA ATGGAGATAT GGATAAATTT
GCCACCTATT GCACTGAAAA TTTTCAAGAA GTAGTGAAAC AATGGTGGAG TTTAGCCTCT
TACCTAATTA TCAAATACAG TAATGGTTGT ATAACCACTG CTCCGGATTC AACTATGAAA
AAAATTGATT ATCCAAAAAA TTGGCTAAAG GAAATTGGCT ATTTTGATGG CCCTGTTGGA
TATTGA
 
Protein sequence
MCTTIIIGKK ASKDGSVIIA HSDDFLGDAR VISVPSFNLE NRNVYYDNAS FGLNKAYNST 
EIRRYIGKDR GEGYDTKDYT SSKPLGVIPG FGKDTYAYFD YEYGIINEKG LMVGECTCGA
KIQPGPDPKK RIFYSSELSR VALERCTKAR EAVELIGKLI FEYGYYGTGE TLSLGDADEG
WVMEMCAYEE DGNSGIWVAQ RVPDDEFFIA ANQFRIRDIH KNNEDDGSDE KLYFNSLDEN
GNLRDDLLYS ANLFNACHKT NWIASDEKCI DWAATVSYGE YLHPYYSLRR VWRAFSKVAP
LSNLPSRVTD GYTKDYPFSL KPENKLSILD VANVFRDFYE GTEFDLTIGP ASGPFRNPIR
YQNNPDQGDT YDLNVYKPEG AWERPLSNHQ CGVLWINQAI KAKGNTEAVC WIGLDRPFAN
CLMPFYCKMD KLPKELQTMN LLDFQFNGDS AWWAFNFVSN FVNLNFLYMM REVKALQERF
ETKTEKDVME ILSNGDMDKF ATYCTENFQE VVKQWWSLAS YLIIKYSNGC ITTAPDSTMK
KIDYPKNWLK EIGYFDGPVG Y