Gene M446_4739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4739 
Symbol 
ID6134862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5209060 
End bp5210340 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID641644876 
ProductABC transporter substrate-binding protein 
Protein accessionYP_001771503 
Protein GI170742848 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.640775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT CGACGAAGCA GACGGGCCTG ACCCGGCGCG GCCTCCTCGC GGGCGCGGCC 
GCCGGCGCCG CCGCGGGCAG CGGCGCCGTC ACGGGCTTTC CCTCCATCAT CGCGGCCGAG
CCCGTGACCC TGCGCTATCT CGGCACCGCC GTGAACCAGA GCGCCGACAT CGCCCGCAAG
GTGAAGGAGG ATCTCGGCAT CACCATCGAG TACATACCGG TCGTCACCGA CGAGGTATCG
AAGCGGGTCG TCACTCAGCC GAACTCCTTC GACATCGTCG ATTCCGAGTA TTTCAGCCTC
AGGAAGCTGA TCCCGTCCGG CAACCTGGTC GGCATGGACG CCCGCCGGAT CAAGTACGCC
GACAAGATCA CCACCGTGTT CACCAAGGGC GAGCTGAACG GCAAGGCGAT CGGCGACCAA
GGCACCGCGC CCAAGAAGGT GTTCTACCTC GAAGGCCAGA CCGCGACGAA GTTCGCCGGC
GCGCCGACCG AGTGGATCAC CCTGATCCCG ACCACCTACA ACGCCGACAC GCTCGGCATC
CGGCCCGACC TGATCAAGCG GCCGATCTCG AGCTGGAAGG AGCTGCTCAA CCCCGAATTC
AAGGGCAAGG CCTCGATCCT CAACATCCCG TCGATCGGGA TCATGGACGC CGCCATGGTG
GTCGAGGCGA TGGGCGAATA CACCTATCCC GACAAGGGCA ACATGACCAA GGCCGAGATC
GACCGCACCA TGAAGGTGCT GATCGAGGCC AAGCGGGCCG GGCAATTCCG GGCCTTCTGG
CAGGATTTCA ACGAATCCGT GAACCTGATG GCCTCCGGCG AGACGGTGAT CCAGTCGATG
TGGTCCCCCG CCGTCACCAA GGTGCGCTCG CAGGGCGTGT CCTGCACCTA CCAGCCGCTG
AAGGAGGGCT ACCGGGCCTG GGCCGCCGGT TTCGGACTCC CCAAGACCCT GACCGGCCGG
AAGCTCGACG CAGCCTACGA TTTCATCAAC TGGTTCCTGT CGGGCTGGGC CGGCGCCTAC
CTCAACCGCC AGGGCTACTA CTCGGCCGTG CTGGAGACCG CCAAGGCCGA GATGGAGCCC
TACGAGTGGG CCTACTGGAT GGAGGGCAAG CCGGCCGAGA AGGACATCAA GGCCCCGGAC
GGCACCGTGC TGGAGAAGGC CGGCGCCCTG CGCGACGGCG GCTCCTTCGC CGAGCGCATG
GGCGCGGTGG CGTGCTGGAA CTCGGTCATG GACGAGAACA CCTACATGGT CCGCAAGTGG
AACGAGTTCA TCGCCGCATG A
 
Protein sequence
MTDSTKQTGL TRRGLLAGAA AGAAAGSGAV TGFPSIIAAE PVTLRYLGTA VNQSADIARK 
VKEDLGITIE YIPVVTDEVS KRVVTQPNSF DIVDSEYFSL RKLIPSGNLV GMDARRIKYA
DKITTVFTKG ELNGKAIGDQ GTAPKKVFYL EGQTATKFAG APTEWITLIP TTYNADTLGI
RPDLIKRPIS SWKELLNPEF KGKASILNIP SIGIMDAAMV VEAMGEYTYP DKGNMTKAEI
DRTMKVLIEA KRAGQFRAFW QDFNESVNLM ASGETVIQSM WSPAVTKVRS QGVSCTYQPL
KEGYRAWAAG FGLPKTLTGR KLDAAYDFIN WFLSGWAGAY LNRQGYYSAV LETAKAEMEP
YEWAYWMEGK PAEKDIKAPD GTVLEKAGAL RDGGSFAERM GAVACWNSVM DENTYMVRKW
NEFIAA