Gene Mext_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1971 
Symbol 
ID5831867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2202775 
End bp2204025 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content70% 
IMG OID641367772 
ProductNa+ dependent nucleoside transporter 
Protein accessionYP_001639441 
Protein GI163851398 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACA GGTTGTTCCA TGCCGGCGCG AGCGTTGCCC TGCTGCTCGC CGTGGCGTGG 
CTGTTCTCGG TGAACCGGCG GGCGATCCGG CCACGGGTGG TGCTCGCCGC CCTGGCGCTT
CAGGTCGGGA TCGGCGCGCT GATGCTGTTC GTACCCGCCG GGCAGAGGGC GCTCGGCGCG
GTGGCGGATG TCGTCACCAC CGTGCTTTCC TTCGGCGACC GGGGCACCGC CTTCCTGTTC
GGCGGCCTCG TCGAGCCGCG GATGTTCGAG CTGTTCGGCG GCTCGGGCTT CATCCTGGCC
CTGCGGGTGC TGCCGCAGAT CCTCTACGTC TCGGCGCTGA TCGGCGTGCT CTACCATCTC
GGGGTGATGC AGGCGCTGGC CCGGTTTCTC GGCGCGGGTT TGCGAAAACT GCTCGGCACC
TCGCCGATCG AATCGTTCTC GGCGGTCGTC ACCATCTTCA TCGGGCAGAG CGAGATCGCC
GTGGCCCTGC GCCCCTTCCT CGCGGCGCTG ACCGGGGCCG AGCTGTTCGC GGTGATGACG
AGCGGGGCGG CCTCCACCGC CGGCTCGATC CTCGCCGGAT ACGCCGCGCT CGGCGTGCCG
ATGCCGTATC TTCTCGCCGC CTCGTTCATG GCGATTCCCG GCGGGCTGCT CTACGCCAAG
ATCCTCGTGC CCTCGACCGA GCCGACGCGC ATCCTCACGA CGCGTGTCGA GTTCGGCGAG
GCGCGGGCGG CCAACCTGAT CGAGGCCGCC GCCGGCGGCA CGCAGAAGGG CCTCGGCGTC
GCGGTCTCGG TCGGCGCCAT GCTGATCGCC TTCGTCGGGC TGATCGCGCT CGTGAACGCC
GGCATCGGCT GGGCCGGCGG CGTGTTCGGG TTCGCCGGCC TCTCGATCGA GGGCATTCTC
GGCGTCGTGC TGGCGCCGCT GGCCTGGCTC TTGGGCGTGC CATGGGAGCA GGCGACCCTC
GTCGGCGGCG CCATCGGCCA GAAGATCGCC TTCAACGAGT TCCTGGCCTA TGCCAGCCTC
TCGCCGATTC TGAAGGCCGG CACCCTCGAC CCGCGCACGA GCGCGATCCT GTGCTTCGCG
CTCTGCGGCT TCGCCAACCT CGCCTCGGTG GCGATTCAGC TCGCGAGCTT CACCAGTCTC
GCCCCCGAGC GCCGGCCCGA GATCGCCCGG TTCGGCCTGC GCGCGATCCT GGCGGGCACG
CTCTCGAACC TCACCAGCGC GGCCATCGCC GGATTGTTCA TCACCGGGTA A
 
Protein sequence
MLDRLFHAGA SVALLLAVAW LFSVNRRAIR PRVVLAALAL QVGIGALMLF VPAGQRALGA 
VADVVTTVLS FGDRGTAFLF GGLVEPRMFE LFGGSGFILA LRVLPQILYV SALIGVLYHL
GVMQALARFL GAGLRKLLGT SPIESFSAVV TIFIGQSEIA VALRPFLAAL TGAELFAVMT
SGAASTAGSI LAGYAALGVP MPYLLAASFM AIPGGLLYAK ILVPSTEPTR ILTTRVEFGE
ARAANLIEAA AGGTQKGLGV AVSVGAMLIA FVGLIALVNA GIGWAGGVFG FAGLSIEGIL
GVVLAPLAWL LGVPWEQATL VGGAIGQKIA FNEFLAYASL SPILKAGTLD PRTSAILCFA
LCGFANLASV AIQLASFTSL APERRPEIAR FGLRAILAGT LSNLTSAAIA GLFITG