Gene M446_3967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3967 
Symbol 
ID6130966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4417220 
End bp4419478 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content68% 
IMG OID641644127 
Producthypothetical protein 
Protein accessionYP_001770767 
Protein GI170742112 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.154728 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCTG ATCGCGGAGG TGCTGCGATT CGGCGCGACG GTCGGGAGAT TCCGCGGTTC 
GCACCGAATT TTTCCGTCTA CGTGCTGCCG CCCAACGTCG TCTGCCTCTA TTCGGAACAC
CGCAAGTTCT TCCTCGAAGG CGACCTCTAT GCCGCGCTCG CGCCGCAACT GGCTGCCGGG
AGGACTCTCA AGGCGATCTT CGACGCGCTG ACGCTCGACT TCCCGGCTCA GGCGATCACC
GAAGCCCTCA GGCGGCTGGT CGAGCGACGC TTCGTCCTGC TCGACCGCGT CGCGGCCGAC
GACGCGGCAG CCGCCTTCTG GGCGAGTCTC GGCCTGCCGC CCGGCGCAGC GGAGTGGAAC
CTCCGGAACT GCAGGGTGCG GATCCGCGCG CTCGACGTAT CGGGCGCCGA CGACCTCGCG
GCGGCGCTGA CCGGTCTCGG CGTGCGCGTG GTCAAGCGCA CCGCCGACCT AACGGTCGTG
CTGGTGAACG ATTACCTCGA TGCGCGCCTC GCGGAGACGA ACCGGCGGCA CCTGTCCGAC
GAGACGCCCT GGCTGATCGT CCAGCCCTCC GGGATCTTCC CGCTAGTGGG GCCTCTGCTC
GCTCCGGGAA AAGGCGCCTG CTGGACCTGC CTCGCCGACC GCATGCGGTG GAACCGCGAG
GTGAAGGCGT TTCTGGAACG GCAGCGGGCC GACCCGGTGG CGCTGTCGCC CCTGGTCCGC
AACCCGGTCG GGCAGAGCGC GACGCCGTTC GCAGCCATCG AGATCGCGAA GGCGATCGCC
ACCGACTTTC GCACGGACCT GCGCGATCAC GTGATCAGCT TCGACCTGAC GGGGGCGACG
ATCGCGCGGC ACTTCGTGGC GGCGCGTCCG CAATGCCCTG CATGCGGCCA TCCGGAGCAG
CGCGACCCGA ACCGTCCGGC GAACCCGCTC GAACTCAAGG CCGGCGGCAA GCTCGTGCTG
ACGAGCGGGG GATACCGAGC GCTGTCGCCC GAGGCCACGC TCGCGCGCTA CCGCAAGCAC
GTGAGCCCGC TCACGGGGGT CGTCTCGCGG CTGGAGCCGA TGGAAGCCGA CCTGTCCCTG
AACACCAGCT ACGTCGCGCG CCACAACTTC TCGCCGCGCC CCGAAACGGT CCCTGCGCTC
AAGGCCGGGT TGGGCAGCGA CAGCTACGGC AAGGGAAGCA CGAGCGAGCA GGCGCAGGCC
AGCGCCCTCA TGGAGGCGAT CGAGCGGTAT TCCGGGATCT TCCAAGGCGA CGAGATCAGG
GTGGTCCGGC GTTTCGACGA GTTCGGGGCC GGCGAGGCGA TCGCTCCGAA CGACGTCATG
CTCTTCAGCG ACGCGCAGTA CCGTCAGGGC CTCACGGGCG CGCACGATCA CGAATCCTTC
ATACCGGCTC CCTTCGACCC CTCCGCCGCG ATCGCGTGGT CGCCGGTCTG GTCGCTGCGG
GACGCAGCGT TCAAGTACCT GCCCACGAGC TTCCTGTACT ATTTCTGCAG GGGATCGGGC
CACGCCGAGA CCAGTGCGGA TTCGAACGGC TGCGCGGCCG GGAACACGAT CGAGGAGGCC
ATCGTGCAGG GCTTCCTCGA ACTCGTCGAG CGGGATGCGT ACGCGATCTG GTGGTACAAT
CGGCTGCAAA GACCGCCCCT GGACCTGGAT GCACTCGATG ACTCCTACAT TCGCGACCTG
CGCGCCCAAT TGACGGAGGC CGGACGCCGA TTGTGGGTGC TCGACATCAC CAACGATCTC
GGCATCCCCA GCTTCGTGGC GATCTCGCAC TGGACGGAGA ACGGTGAGGA GTGCGTCGAG
TTCGGCTCGG GCTCCCATTT CGACACGCGG ATCGCTGCGC TACGCGCCAT CACCGAACTC
AATCAGTTCT TCTCGATCGG GCTCATGGCG CGCCGGCACA GCGTGGATCC GGGTGACGAC
AGCGCCCATC GGTGGCGCCT CGACAACAAT CCGTACTTCG TGCCCGACGG TCAGCCGAGG
CTCCCACCGG ATTTCCGCTC CGGTTTCGCG CGCCTCGATC GCCGGGACCA AGTCCTCGCC
TGCGTGGACC TGATGGCGTC CCGCGGACTC GAATTCCTCG TCCTCGACCA GACGCGGCCG
GATATCGGCG TGCCGGTTGT CAAGGTGATC GTTCCCGGGA TGCGGCACTT CTACCCGCGT
TTCGGACCCG GCCGGCTCTA CGACGTTCCG CTCGCCCTCG GCTGGCTCGA CCGGAGCGTC
CCCGAGCGCG ACCTCAATCC GCTGTTCCCG CCGACCTGA
 
Protein sequence
MTADRGGAAI RRDGREIPRF APNFSVYVLP PNVVCLYSEH RKFFLEGDLY AALAPQLAAG 
RTLKAIFDAL TLDFPAQAIT EALRRLVERR FVLLDRVAAD DAAAAFWASL GLPPGAAEWN
LRNCRVRIRA LDVSGADDLA AALTGLGVRV VKRTADLTVV LVNDYLDARL AETNRRHLSD
ETPWLIVQPS GIFPLVGPLL APGKGACWTC LADRMRWNRE VKAFLERQRA DPVALSPLVR
NPVGQSATPF AAIEIAKAIA TDFRTDLRDH VISFDLTGAT IARHFVAARP QCPACGHPEQ
RDPNRPANPL ELKAGGKLVL TSGGYRALSP EATLARYRKH VSPLTGVVSR LEPMEADLSL
NTSYVARHNF SPRPETVPAL KAGLGSDSYG KGSTSEQAQA SALMEAIERY SGIFQGDEIR
VVRRFDEFGA GEAIAPNDVM LFSDAQYRQG LTGAHDHESF IPAPFDPSAA IAWSPVWSLR
DAAFKYLPTS FLYYFCRGSG HAETSADSNG CAAGNTIEEA IVQGFLELVE RDAYAIWWYN
RLQRPPLDLD ALDDSYIRDL RAQLTEAGRR LWVLDITNDL GIPSFVAISH WTENGEECVE
FGSGSHFDTR IAALRAITEL NQFFSIGLMA RRHSVDPGDD SAHRWRLDNN PYFVPDGQPR
LPPDFRSGFA RLDRRDQVLA CVDLMASRGL EFLVLDQTRP DIGVPVVKVI VPGMRHFYPR
FGPGRLYDVP LALGWLDRSV PERDLNPLFP PT