Gene M446_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1048 
Symbol 
ID6131781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1163520 
End bp1164554 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content75% 
IMG OID641641341 
Producthypothetical protein 
Protein accessionYP_001768013 
Protein GI170739358 
COG category[R] General function prediction only 
COG ID[COG2984] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGGC CGTTCGCGCC GGCGATCCCG GCCGCGGCGC GCGCGCCCGT GAGGCGGGCG 
ATCCGGCGGC GCGACCTGAT CCCGCTCTGC GCCGCGGCGG CGGCGTGGCC CCTCGGCGCG
CGCGCGGAGC GCGGTCCGCG CCGGATCGGA ATCCTCGTCA CGGGGTCGCC GGACTCGCAC
GGCGCCTTCG TGGCGGCGTT CCGGCGGCGG CTAGCGGAAC TCGGCCACGC CGAGGGACGG
GACGTCGCCT TCGACCTGCG CTGGAGCGAA GGCCGGATCG AGCGCCTGGG GCCCCTCGCG
GAGGATCTCG CGCAGCTCGC CCCGGACCTC GTGGTGACCT CGACGACCGC CGCGGCCCTG
GCCGCCAAGC GCGTCATGCC GGAGCGCCCG ATCGTGTCCG CGACCCTGAT CGACCCGATC
GGCGCCGGGC TGGTGACCAG CCTCGCCCGC CCGGGCGGCA CCGTCACGGG CATGCTGATC
AGCTTCGAGA CCCTCCTCGG CAAGCAGCTC GAAGTGGCCC GCGAGATGCT GCCGGGCGTC
ACGCGGATCG GGATGCTGGT CAACCCGGCC AATCCGGTGA TCCCGTTCCA GCGCGAGAAC
ACGCAGGCCT ATGCGGACCG GCTGCGGGCG CGGCTGATCC CGGTCGAGGC CCGCTCCCCG
GCGGACCTCG ATGCCGCCTT CGCGACCTTC GCGCGGGACT CCGCCGGCTT CGTGATCGTG
CTGCTGGACG CGCTGTTCAT CACCCACCGC GCGCGGATCG CCGAACTCGC CCTCGCGTCG
CGCGTCCCGA CCGTCGCGGG CGCGCGCGAG TTGGCGGAGG CGGGCGGCCT CGTGAGCTAC
GGGATCGACC TGAGCGCGAC CTGGCGCCAG GCGGCCGCCT TCGCGGACCG CGTCCTGCGC
GGCGCCAGGC CGGCGGACCT GCCTGTCGAG CTTCCGACCA AGTACGAACT CGTGCTCAAT
CTCGGGGCCG CTTCGCGCTT CGGGATCACG GTCTCGACCA TGCTGCTCGC CCGCGCCGAC
ACGGTCATCG AGTGA
 
Protein sequence
MPGPFAPAIP AAARAPVRRA IRRRDLIPLC AAAAAWPLGA RAERGPRRIG ILVTGSPDSH 
GAFVAAFRRR LAELGHAEGR DVAFDLRWSE GRIERLGPLA EDLAQLAPDL VVTSTTAAAL
AAKRVMPERP IVSATLIDPI GAGLVTSLAR PGGTVTGMLI SFETLLGKQL EVAREMLPGV
TRIGMLVNPA NPVIPFQREN TQAYADRLRA RLIPVEARSP ADLDAAFATF ARDSAGFVIV
LLDALFITHR ARIAELALAS RVPTVAGARE LAEAGGLVSY GIDLSATWRQ AAAFADRVLR
GARPADLPVE LPTKYELVLN LGAASRFGIT VSTMLLARAD TVIE