Gene Mbar_A3721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3721 
Symbol 
ID3625011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4799988 
End bp4801649 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content46% 
IMG OID637702554 
Productsodium:proline symporter (proline permease) 
Protein accessionYP_307164 
Protein GI73671149 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCA GTACTTCAAC TCTAATCCTC CTTGTCATTA TTTACTTTAT GTGCACCTTC 
TACGTTGCCC GACTGGGGTA TAAGAAAAAT TCCCAGACCG ATGATGGGTA TATGCTTGCA
GGCCGACGTG TACCCCCGGC AATTATGGCT CTCTCCTACG GAGCTGCGTT CATCAGCACT
TCTGCAATTA TAGGATTTGG AGGAGTAGCC GCTTCCTCAG GAATGGGGCT TCTGTGGCTT
GTCTTTATGA ATATCTTTTT CGGAATTTTC ATTGCCTTCG TGATTTTTGG CCCAGGAACC
CGGCGCATGG GGCTAAACCT TGGAGCCATT ACTTACCCTG AATTCATAGG AAAACGTTTC
CAATCAAGAT TTATTCAAGC ATTTTCCGGT CTCCTTATAG CGGTCTTCAT GCCTCTTTAT
GCTGCAAGCG TGATCATAGG AGCAGGCAGA TTTCTTGAAA CCACGCTTGC GATAAACTAT
AATGTCGCTC TTTTGATCTT TATCGTCATC ATTGCTTTTT ATGTAATTAA AGGCGGCCTC
CTCTCAGTAA TGTACGTGGA TGCAATGCAA GCCACTATCA TGCTGATAGG AATGACCTTT
TTGCTGGTAT ATACCTACAG TAAACTGGGA GGAGTCGTAG AAGCCCACCA GGCTCTTACC
AATATGGCAA ACCTTGTGCC TCAGGCGCTT GTTGACCAGG GTCACAGGGG CTGGACTTCA
ATGCCAGCTT TTAATTCCTC AATCTGGTGG ACAATGGTTT CCACAATTGT TATGGGGGTA
GGGATAGGAG CACTTGCACA GCCACAGCTT GCAGTCAGGT TCATGACTGT AAAGGACGAC
CGTTCCTTGA AAAGAGCAGT TGCTGTGGGA GGTCCCTTCC TTCTTATGAT GGCAGGAGTT
ACATATGTTG TGGGAGCACT TTCTAATGTA TACTTTTACA GAACTACAGG AATGATCGCC
ACTCAGTTTG TCCCGGGTGG AAACACCGAC CTTATAATTC CCGCATACCT GAACCACGCA
ATGCCCGCGC TGTTTGTGGC AATATTCATG CTAAGCCTGC TTTCGGCAGC AATGTCTACT
GCAGCTGCCC AGTTCCACAC CATGGGTACA GCCATAGGAT ATGACTTTTA CCAGCACGGC
CTTATGAAAG GCAAGTCAAG TTCGAGCACG GTTCATGTTA CAAAAATAGG AATTGCCTTT
ACCATCGTGG TAGCAGTTAT TCTGGCTTAC GTTCTTCCTG GAAGTATTAT CGCAAGAACT
ACTGCCATGT TCATGGGACT GTGCACATCC GCTTTCCTGC CTCTCTATAT TGGAGCGCTG
TTCTGGAAAC GCACAACGAA AGCCGGAGCA ACTGCAAGTC TTGTTATAGG ATCGATAAGC
AGTCTTTTCT GGCTGGTCTT TGTCCATGCA AAAGAAGCAG TGTCTCTAGG AATCTGCCAG
GCAATCTTTG GAAAAGAAAC TTTACTTACA GGCACCTGGC CCCTTGTAGA TCCAATCATG
ATCGCAACTC CACTGTCTTT CCTTGTCCTG ATTGTAGTAA GCCTTATGAC ACCCCGTTTC
TCTCCAGAGT TTCTCAAAAA GGCTTTCAGG CTCAGGTTTG AAGATGAAGA CGAAGCATCG
GAAGCAACAT CAAACAGTGT AGCGGATTCG ACAGGCGTTT AA
 
Protein sequence
MAVSTSTLIL LVIIYFMCTF YVARLGYKKN SQTDDGYMLA GRRVPPAIMA LSYGAAFIST 
SAIIGFGGVA ASSGMGLLWL VFMNIFFGIF IAFVIFGPGT RRMGLNLGAI TYPEFIGKRF
QSRFIQAFSG LLIAVFMPLY AASVIIGAGR FLETTLAINY NVALLIFIVI IAFYVIKGGL
LSVMYVDAMQ ATIMLIGMTF LLVYTYSKLG GVVEAHQALT NMANLVPQAL VDQGHRGWTS
MPAFNSSIWW TMVSTIVMGV GIGALAQPQL AVRFMTVKDD RSLKRAVAVG GPFLLMMAGV
TYVVGALSNV YFYRTTGMIA TQFVPGGNTD LIIPAYLNHA MPALFVAIFM LSLLSAAMST
AAAQFHTMGT AIGYDFYQHG LMKGKSSSST VHVTKIGIAF TIVVAVILAY VLPGSIIART
TAMFMGLCTS AFLPLYIGAL FWKRTTKAGA TASLVIGSIS SLFWLVFVHA KEAVSLGICQ
AIFGKETLLT GTWPLVDPIM IATPLSFLVL IVVSLMTPRF SPEFLKKAFR LRFEDEDEAS
EATSNSVADS TGV