Gene M446_5467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5467 
Symbol 
ID6131820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5997754 
End bp5998698 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content74% 
IMG OID641645601 
Productaliphatic sulfonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_001772217 
Protein GI170743562 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.221886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGCA GAGACCTCCT CGCCGGCGCC TCGGCGCTCC TCGCCGCCGG CGGCCTTCCC 
GCCCGCGCGG CCGCGGCCCT GCCCAGGGAA CTGCGCCTCG GCTTCCAGAA ATCCGGCCTG
TTCGTCTCGG CCCGCCAGCG CGGCGTCTAC GAGGCGCATT TCCGGCCGCT CGGCGTCCCG
GTGCGCTGGG TCGAGTTCCA GTTCGGCCCG CCCATGCTGG AGGCGCTGAA CCTCGGCGCC
ATCGACTTCG CCACGGTGGG CAACGCCCCG CCGATCTTCG CCCAGGCCGC CTCCGGCAAC
CTCCTGTACG TGGCCGCCCA GGAGGCGGGC GGCGAGGCCG TGATCGTGCC CGAGGGCTCG
GGGCTGCGCA GCCTCGCCGA CCTCAGGGGC CGCACGGTCG GGGTGCCCAA GGGATCGAGC
GCCCACGCCA CCCTGGTGGC GGCGGTCGAG AAGGGCGGCC TCGGCTGGGG CGACATCAAC
CCGGTCTACC TCGCCCCCGC GGACGGCGTC GCGGCCTTCG CCCGCGGCGC GATCGACGCG
TGGTCGATCT GGGATCCCTA CCTGGCGATC GCGGAGGGCA AGGGGGCCCG CGTCCTCGCC
CACAACCACG AGGTGGCGAA CCCGCACAGC TTCTACCTCG CCAACCGGGC CTTCGCCGAG
ACCTACCCGG AGGTGGTCGG TCAGATCGCG GACGTGCTGG CGCGGGAAGC CGCCTGGGCC
GAGGCCAACC GCGACGCCTA CGCGCGGACG TTGCACGAGG CGCAGGGCAT CCAACTCGAG
GTCGAGGCGG CGATCGTCGC CCGCACCCGC TTCCGGATCA AGCCGATCGA CGAGGCGGTC
CTGGACGGCC AGCAGGCCAC CGCCGACCGC TTCCACCGCC TCGGCCTGAT CCCGCGCGCG
ATCCGGGTCC GCGACATCGC CTGGGCCTGG ATCCCCAAGG CCTGA
 
Protein sequence
MRRRDLLAGA SALLAAGGLP ARAAAALPRE LRLGFQKSGL FVSARQRGVY EAHFRPLGVP 
VRWVEFQFGP PMLEALNLGA IDFATVGNAP PIFAQAASGN LLYVAAQEAG GEAVIVPEGS
GLRSLADLRG RTVGVPKGSS AHATLVAAVE KGGLGWGDIN PVYLAPADGV AAFARGAIDA
WSIWDPYLAI AEGKGARVLA HNHEVANPHS FYLANRAFAE TYPEVVGQIA DVLAREAAWA
EANRDAYART LHEAQGIQLE VEAAIVARTR FRIKPIDEAV LDGQQATADR FHRLGLIPRA
IRVRDIAWAW IPKA