Gene Mbar_A2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2029 
Symbol 
ID3627846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp2563280 
End bp2564347 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content42% 
IMG OID637700907 
ProductABC transporter, ATP-binding protein 
Protein accessionYP_305543 
Protein GI73669528 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0722131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.56816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATTC CTTCCTTCTC CACTCAATTT CTTTCAGTAG AAAACCTGAG CGTCTCTTTC 
AAAACCTCAA AGGGCCTTGT AAAAGCAAAC GAGAACATTT CCTTTGAGAT CAAAGAAGGA
GAAATTTTTG GGCTTATTGG AGAAACAGGC TGTGGAAAAA CTACTCTTGG AAAAGCCCTT
TTGAGGCTGC TCTCAAATAA TGCAAGGATA GAAGGGAGAA TCGTTTACAG GGGTAAGAAT
ATATTAAGTC TTTCTGAAAA AGAAATGAGG AGCCTGAGAG GAAAGGAAAT CGGGATCATG
CTACAGGATC CTTCAGTCTG TTTTAACCCA GTGCTCTCTA TAGGAAGTCA AATTGCTGAA
ATTTATCGAT ACCATGAAGG CATGAGAAAA AAAGATGCAA AAAAGAAAGC GTCAGAGATG
CTTGAGCTTG TTGGAATAGA TTCCTCAAGA AAGTCTGAAT ACCCTCACCA GTTCAGTGGT
GGGATGCTGC AGAGAGTTAT GATAGCAGTA GCGCTTGCTC TCAAGCCAAG ACTTCTTATT
GCAGACGAAC CTACAAAGGG GCTTGATCCT GATATGAAAT TGCAAATTCT AGAAATTATT
ACCAAGCTTG TCCGGAAGGA AAATTCTTCC ATGCTCTTAA TTACTCATGA TCTGGATGTA
GCCACTAAAC TTACAGATAG GACTGCAGTG ATGTATGCAG GAGAAATCGT TGAAATCGGG
AAGACAGCAA CGGTCATTTC CGATCCAAAA CACCCTTATA CTTTTGCACT ACTGCATTCT
CTTCCAGAAA AAGGGCTAAT GACTGTTTTA GGTCAGTCTC CAAGCCTGAT CTCCCCTCCT
TCCGGTTGCA GGTACCATCC CCGATGTAGT AACCAGCTGG CAGACTGCTC AAAGATCCAC
CCTGAACTAT TGGAACATGT AGATGATCAT TTTGTCCGCT GCTTATTATC TGAAAAAAAC
AGTAAGAATG TAAGAAGAAA AGACAGTAAG AATGTAAGAA GAGAAGCACA ATTGATCCCA
GATACTCCCT CTCCTGGGTG TGCGGAGGTG GGCTTATGGC ATTGTTAG
 
Protein sequence
MSIPSFSTQF LSVENLSVSF KTSKGLVKAN ENISFEIKEG EIFGLIGETG CGKTTLGKAL 
LRLLSNNARI EGRIVYRGKN ILSLSEKEMR SLRGKEIGIM LQDPSVCFNP VLSIGSQIAE
IYRYHEGMRK KDAKKKASEM LELVGIDSSR KSEYPHQFSG GMLQRVMIAV ALALKPRLLI
ADEPTKGLDP DMKLQILEII TKLVRKENSS MLLITHDLDV ATKLTDRTAV MYAGEIVEIG
KTATVISDPK HPYTFALLHS LPEKGLMTVL GQSPSLISPP SGCRYHPRCS NQLADCSKIH
PELLEHVDDH FVRCLLSEKN SKNVRRKDSK NVRREAQLIP DTPSPGCAEV GLWHC