Gene M446_1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1431 
Symbol 
ID6132117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1574168 
End bp1575166 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content71% 
IMG OID641641711 
Productextracellular solute-binding protein 
Protein accessionYP_001768381 
Protein GI170739726 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.633528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGCC GCAGCTTCCT GGCCGGCGCC ACCGCGCTCG GCCTCGCCGC GCCCGCCCGC 
GCCGAGACCG TCAAGGTCCG CATCGGCGTC GGCGGCAAGC CGCTCCTGTA CTACCTGCCG
CTCACCATCG CGGAGAAGAA GGGCTACTTC GTCGAGGAGG GGGTGGAGGC CGAGATCAAC
GATTTCGGCG GCGGCGCCCG CTCGCTCCAG GCGCTGATCG GCGGCTCGGT CGACGTGGTG
ACGGGGGCCT ACGAGCACAC GATCCGCATG CAGGCCAAGG GACAGGACGT GCGGGCGGTG
TGCGAACTCG GCCGCTACCC GGCGATCGTG ATCGCGGTGC GCAAGGACCT CGCCGGGACG
GTGCGGGGCC CGGGCGATCT CAAGGGCCGC AAGATCGGCG TGACGGCGCC CGGCTCCTCG
ACGGCGCTCG CGGTGCAGTA CGCGATGATC AAGGCGGGGC TGAAGGCCAC GGACGCGCCG
CTCATCGGCG TCGGCGGCGG GGCGGGCGCC ATCGCGGCGA TGAAGAAGGG CGAGATCGAC
GCGATCTCCC ACCTCGACCC GGTCATCGCC AAGCTGGAGG CGGACGGCGA CATCGCCGTG
ATGATCGACA CCCGCACGGA GGCCGGGACC CGGGCGCTGT TCGGCGGGCC GAACCCGGCG
GCGGTGGTCT ACACCAAGCA GGAGTGGATC GAGCGCCACG CCGCCGCGAC CCAGAAGGTG
GTCAACGCCT TCGCGAAATC GCTGAAGTGG CTCGCCGCCG CCACTCCCGA GGAGGTCGCC
GACACGGTGC CGCCCGCCTA CCATTTCGGC GACCGGCCGC TCTACGTGCA GGCGGTGAAG
AACTCGCTCG AGAGCTATTC CCGCACCGGC ATCCCCTCGC AGGAAGGCAT GGCGAGCGTG
CTCGACCTCG TGCGCACCCT CGATCCGGAG CTGCAGGGCG CCAAGATCGA CCTCGCGGCG
ACGCTGGAGG ACCGCTTCAT CCGCAAGGCG ATGGGCTGA
 
Protein sequence
MDRRSFLAGA TALGLAAPAR AETVKVRIGV GGKPLLYYLP LTIAEKKGYF VEEGVEAEIN 
DFGGGARSLQ ALIGGSVDVV TGAYEHTIRM QAKGQDVRAV CELGRYPAIV IAVRKDLAGT
VRGPGDLKGR KIGVTAPGSS TALAVQYAMI KAGLKATDAP LIGVGGGAGA IAAMKKGEID
AISHLDPVIA KLEADGDIAV MIDTRTEAGT RALFGGPNPA AVVYTKQEWI ERHAAATQKV
VNAFAKSLKW LAAATPEEVA DTVPPAYHFG DRPLYVQAVK NSLESYSRTG IPSQEGMASV
LDLVRTLDPE LQGAKIDLAA TLEDRFIRKA MG