Gene Mbar_A2787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2787 
Symbol 
ID3625115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3557682 
End bp3558866 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content43% 
IMG OID637701638 
Productputative ABC transport system permease protein 
Protein accessionYP_306268 
Protein GI73670253 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000544033 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000418311 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTGACG AAGAATTTTC AATGGCAATA AGGCAGTTGC GTCTTAAAAA ACTCCGAAGC 
CTGTTGATCC TTCTAGGAAT CGCAATTGGA GTAGCATCTG TAGTAGCCGT CGTATCCTTA
GGAGAAGGTC TCCGAATCAA TGCGGTAGAA GAAATTCAGA AGTCTCGTGA CCTGACTTTG
ATTGATGTTT CTCCGGGTTT AAAAGACAAT GGACTCATCC TGATAAGCGA GTCGAAAGTA
GAAGAACTAA AAGGATATGG AAAGCTTGTC TGTCCGTATG TAAAGGATGC CTATGTTAGC
CCTTCAGGAA GCTATTTTGA CTTGTTTGGT GTTCAGGAAG ATTACAGAAC TGCAAATGAG
CTTGAGCTTG CCGGAGGGAG CTGGTTTGAG AGTGGACAGA ACCAGATTGT ACTTGGAAGT
GACCTGTGGG GAAAACTGGA AAAATTAGAT GGAGCCAAAA TAGGAACACC CATGACAGCT
AAGCTGCGGC TTTATGGAGA CGATGGAAGA CCAATCGATA AGGAAGTAAG CTTTATTCCA
ATGGGCTACC TAAAGCCAAC CGGAGGCGAA ATCGACAATG GAGCCTTTAT GAGGCTTGAA
TATGCAAAAA AACTCAGCAA AAAGGAGTAT TATGACGGAG CTCTGGTTAA GGTGGAAAGT
TCCTCTTTTG TTGCCGATAC AAGAAATCAG ATCGAAAAAC TTGGGCTTGC TACTTCAAGT
GCCCAGGATG AAATCGATTC AGTTAACCGG ATAATGAATG GGATAACTCT TGTCCTTGCC
TTTTTTTCGA GTATTACCTT GCTTGTAGGT GGACTAATGG TAATCAACAC TATGGTGGTA
TCGGTCTACG AACGAACCAG GGAGATAGGG ATCTCAAAAG CTCTTGGTGC TTCAGAATCT
GACATCCTGC GAATGTTTTT AGCTGAATGT CTGTTTATTG GGGCACTTGG AGGGATTTTT
GGAGATTTTT TTGGTATCAT ATTTTCAACG CTCATAGATA GAGTAGGAAG GCCCCTGTTG
GTATCACGTC TCGGAATTGA AAATATTGGC CATCTGACTG CATTAAACTT CGAGATTCTT
GCAGCCGGTT TTATAATCTC CTTATTTGTG TCCGTGCTTT CAGGACTTTA TCCTGCCTGG
AGGGCTGCAA AATTGGATCC TATTAAGGCA CTGAGACATC TTTGA
 
Protein sequence
MFDEEFSMAI RQLRLKKLRS LLILLGIAIG VASVVAVVSL GEGLRINAVE EIQKSRDLTL 
IDVSPGLKDN GLILISESKV EELKGYGKLV CPYVKDAYVS PSGSYFDLFG VQEDYRTANE
LELAGGSWFE SGQNQIVLGS DLWGKLEKLD GAKIGTPMTA KLRLYGDDGR PIDKEVSFIP
MGYLKPTGGE IDNGAFMRLE YAKKLSKKEY YDGALVKVES SSFVADTRNQ IEKLGLATSS
AQDEIDSVNR IMNGITLVLA FFSSITLLVG GLMVINTMVV SVYERTREIG ISKALGASES
DILRMFLAEC LFIGALGGIF GDFFGIIFST LIDRVGRPLL VSRLGIENIG HLTALNFEIL
AAGFIISLFV SVLSGLYPAW RAAKLDPIKA LRHL