Gene Mext_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3961 
Symbol 
ID5835618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4402392 
End bp4403303 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content67% 
IMG OID641369752 
Productextracellular solute-binding protein 
Protein accessionYP_001641403 
Protein GI163853360 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.323027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCTGA GTCATGCGCT TTTCCTCGCC GCCCTCGCGA TTTCGGCGGC CACGGCACCG 
GTCGGCGCGC AGGAGTTGAG CGGAACCCTC AAGAAGGTGA AGGACACGGG CGCCATCACC
ATCGGCTATC GCGACGCCTC GGTGCCGTTC TCCTATCTCG ACGGCAATCA GAAGCCGGTG
GGCTACGCCT TCGAGATCTG TCTCAAGGTC GCCGACGCGG TCAAAGCGCA TCTGAAGCTC
GACACGCTGG AGGTGCGGCT CAACCCCGTC ACCTCCGCGA CCCGCATCCC GCTGATCGCC
AACGGGACGA TCGACCTCGA ATGCGGCTCG ACCACCAACA ACGCCGACCG GCAGCGGCAG
GCGGCCTTCA CCAACACCCA CTTCCTCACC GCGACACGCT TCGTCGCCAA GCGGGACAAG
GGGCTCGACA AGACCGACGA CCTCAAGGGC CGCACCGTGG TCTCGACCTC GGGCACCACC
AACATCCGCC AGATCAACGA GATCAACACC GCCCGGGGCC TCGGCATGCG GATCCTGCCG
GCCAAGGACC ACGCCGAGGC CTTCCTGATG GTCGAGACCG GCCGCGCCGA CGCCTTCGTG
ACGGACGACG TGCTGCTCGC CGCCCTCGTC GCCGGATCGA AGACGCCCGA CGCCTACGCG
ATCTCCTCGG AGGCGCAGTC GCGCCCCGAG CCCTACGGCA TCATGCTGCG CAAGGACGAC
CCGGCCTTCA AGGCCGTGGT CGATGCCGCC ACCGCCGCCC TCTACAAGAG CCCGGAGGGG
ACGGCGCTCT ACGAGAAGTG GTTCACGCAA GCCATCCCGC CGCGGGGCAT CAACCTGAAG
CTCCCGATGA GCGAGGCGAT GCGGAAGGCC TTGGCCAACC CCAGCGACAG CCCTGACCCG
GCGGCCTACT GA
 
Protein sequence
MHLSHALFLA ALAISAATAP VGAQELSGTL KKVKDTGAIT IGYRDASVPF SYLDGNQKPV 
GYAFEICLKV ADAVKAHLKL DTLEVRLNPV TSATRIPLIA NGTIDLECGS TTNNADRQRQ
AAFTNTHFLT ATRFVAKRDK GLDKTDDLKG RTVVSTSGTT NIRQINEINT ARGLGMRILP
AKDHAEAFLM VETGRADAFV TDDVLLAALV AGSKTPDAYA ISSEAQSRPE PYGIMLRKDD
PAFKAVVDAA TAALYKSPEG TALYEKWFTQ AIPPRGINLK LPMSEAMRKA LANPSDSPDP
AAY