Gene M446_4520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4520 
Symbol 
ID6133815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4975169 
End bp4976788 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content73% 
IMG OID641644660 
ProductABC transporter related 
Protein accessionYP_001771295 
Protein GI170742640 
COG category[R] General function prediction only 
COG ID[COG4172] ABC-type uncharacterized transport system, duplicated ATPase component 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.425757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCTC CGCTCCTCTC CGTCCAGGAC CTGTCCGTCG CCTTCCGCCA GGGCGGGCGG 
GAGACCCGCC CGGTCGACCG GGTCTCGTTC GAGATCCAGC CCGGCGAGAC ACTCGCGCTG
GTGGGCGAAT CCGGCTCCGG CAAGTCGGTG ACGGCGCTCA GCGTGCTGCG GCTCCTCGAC
GGCGCGACGC CCGAGGGGCG CATCCTGTTC AAGGGCCGCG ACCTCCTCGC CCTGCGCGAG
GCCGAGATGC GGGCGGTCCG CGGCGCCGAC ATCACCATGG TGTTCCAGGA GCCGATGACC
TCCCTCAACC CGCTCCACCG CATCCTCGAC CAGATCGGCG AGGTGCTGCG GCTGCACCGC
AAGGTGCCGG AGGCGGGCGT GCGGGCGCGG GTGCTCGAAC TCCTCGACCT CGTCGGGATC
CGCGACGCGG AATCCCGCCT CACCGCCTAC CCGCACGAAT TGTCCGGCGG CCAGCGCCAG
CGGGTGATGA TCGCCATGGC GCTCGCCTGC GAGCCCGACC TGCTGATCGC GGACGAGCCG
ACGACGGCGC TCGACGTCAC CGTCCAGGCC CAGATCCTGG CGCTGCTGGC CGACCTCAAG
GCCCGGCTCG GCATGGCGAT GCTGTTCATC ACCCACGATC TCGGGGTGGT GCGCCGGGTC
GCGGACCGGG TCTGCGTGAT GTTCCAGGGC CGGATCGTCG AGCGGGGCGA GGTGGCGCGG
GTCTTCGCCG ATCCGCAGCA CGACTACACC CGCCGCCTGC TCGCCGCCGA GCCGAGGGGG
CGGGGCAACC CGGTGGCGGC GTCCGCCGAG ACGCTGGTCG AGGCAGGCCC GATCCGGGTC
TGGTTCCCGA TCCGCCAGGG GCTGCTGCGC CGCACGGTCG GCCACGTGAA GGCGGTGGAC
GGGGTCTCGG TCCGGGTGCG GGCGGGCGAG ACGGTCGGGG TCGTGGGCGA ATCGGGCTCG
GGCAAGACCA CGCTCGGCCT CGCGCTCCTG CGGCTCATCG GCTCCGACGG GCCGATCGTC
TATCTGGGCC GCCGCCTCGA CGGGCTCGCG CAGGGGGCGA TGCGGCCCCT GCGCCGCGAG
ATGCAGGTGG TGTTCCAGGA TCCCTACGGC TCGCTCTCGC CGCGCATGTC GGTGGCCGAG
ATCGTCGAGG AGGGGCTGCT GGTGCAGAAG GCGGGCCGCG GCGCCGCCGA GCGGCGGCGC
ATCGTGGCCC GGGCCCTGGA GGATGTCGGG CTCGACCCGG CCGCCATGGA CCGCTACCCG
CACGAATTCT CCGGCGGCCA GCGCCAGCGC ATCGCCATCG CCCGGGCCAT GGCGCTCGAT
CCGCGCTTCG TTGTCCTCGA CGAGCCGACC TCCGCCCTCG ACATGTCGGT GCAGGCGCAG
ATCGTGACGC TGCTGCGGGA GCTTCAGCGC CGGCGCGGTC TCGGCTACCT GTTCATCAGC
CACGACCTCA AGGTCGTGCG GGCGCTCGCG AACCGGGTGG TGGTGATGCA GAACGGGCAG
GTGGTGGAGG AGGGCGAGGC CGAGGCGATC TTCACCGCCC CGCGGACCGC CTATACCCGC
GCCCTGTTCG CCGCCGCCTT CGACCTCGCG GCCGCTCCCG CCGGCGCGGT GCGCGAGTGA
 
Protein sequence
MSAPLLSVQD LSVAFRQGGR ETRPVDRVSF EIQPGETLAL VGESGSGKSV TALSVLRLLD 
GATPEGRILF KGRDLLALRE AEMRAVRGAD ITMVFQEPMT SLNPLHRILD QIGEVLRLHR
KVPEAGVRAR VLELLDLVGI RDAESRLTAY PHELSGGQRQ RVMIAMALAC EPDLLIADEP
TTALDVTVQA QILALLADLK ARLGMAMLFI THDLGVVRRV ADRVCVMFQG RIVERGEVAR
VFADPQHDYT RRLLAAEPRG RGNPVAASAE TLVEAGPIRV WFPIRQGLLR RTVGHVKAVD
GVSVRVRAGE TVGVVGESGS GKTTLGLALL RLIGSDGPIV YLGRRLDGLA QGAMRPLRRE
MQVVFQDPYG SLSPRMSVAE IVEEGLLVQK AGRGAAERRR IVARALEDVG LDPAAMDRYP
HEFSGGQRQR IAIARAMALD PRFVVLDEPT SALDMSVQAQ IVTLLRELQR RRGLGYLFIS
HDLKVVRALA NRVVVMQNGQ VVEEGEAEAI FTAPRTAYTR ALFAAAFDLA AAPAGAVRE