Gene M446_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1701 
Symbol 
ID6132631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1914095 
End bp1917037 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content72% 
IMG OID641641959 
Product5'-nucleotidase domain-containing protein 
Protein accessionYP_001768628 
Protein GI170739973 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.507606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAATT ACACGCTCCA GCTCCTGCAC ATGTCGGACG GCGAGGGCTC GACCCTCTCG 
ACCCAGACCG CCCCGGTGAT GGGCGCGCTG ATCGACCGCT TCGACGGCCA GTACGCCAAC
ACGCTGGTGC TCGCCGGGGG CGACAATTAC ATCCCCGGCC CGTTCCTCAC CGCGGGCGCC
GACCCCGCGC TGAACCGCGT CGTCGGCGCC ACGGCCCTCG GCCGGCCCGA CGTCGCGATC
TACAACGCCT ACGGGGTCAA GGCCTCGGCC ATCGGCAACC ACGAATTCGA TCTCGGCTCG
CAGACCGTCG CGGACGCGAT CACGCCCTCG GGCGCCTGGG GCGGCGCGCG CTTCCCCTAT
CTCTCGGCCA ATCTCGACTT CTCGGGCGAC GCGGCCCTGC GCGGCCGGGC GACCGCGGGC
GGGCAACCCT CCTCGGCGAT CGCCGGCCGC ATCGCGCCCT CGACCATCGT CACCGTCAAC
GGCGAGCGCA TCGGCGTCGT CGGCGCCACG ACCCAGCTCC TGGAGCGGAT CTCCTCGCCG
ACCGGCACGG AGGTGAACGG ATTCCCCAAG GCGGGCGAGC CCGGCGACAA CCTCGTCGAG
CGCGACGACA TGGCGCTGCT GGCGAGCCAG CTCCAGCCGG CCATCGACGC GCTCGTCGCC
CAGGGCGTCA ACAAGATCGT CCTCCAGAGC CACCTCCAGC TCCTCTCGAA CGAGCAGGCC
CTGGCGCCGC TCCTGCGCGG GGTCGACATC ATCCTGGCGG CGGGCTCGCA TACCCGCCTC
GGCGACGCGA CCGACGTGCA GGCCGCCTTC CCGGGCCACG ACGCCACCTC CGCCGGTCCC
TACCCGATCG TCACCGCGGG GGCGGACGGC GCCCCGACCA TGATCGTCGC CACCGACAGC
GAATCGACCT ATCTCGGCCG CCTCGTGGTC GATTTCGACG AGAACGGCCG CATCGTCCCG
GGCAGCCTCA ACCCGGCGGT GAGCGGCACC TACGCGGCGA CCCAGGCGAC CCTGCAGGCC
GCCTACGGCG CCGACATCGC CCGGGCCTTC GCGCCCGGCT CGATCGGCGC CCGGGTGAGG
GAGGTCACCG ACGCGGTCGG CCAGGTGATC AGCACCAAGG AGCGCAACGT CTACGGCTTC
ACGGGCGTCT ACCTGGAGGG CGACCGGGCC TTCGGGCGCG CCCAGGAGAC CAATCTCGGC
GACCTCTCGG CGGATGCCAA CGCCGCCGCG GCGGCCCGCG CCGTCGCCAG CCAGCCCTAC
CTCGTCTCCC TGAAGAACGG CGGCGGCATC CTCGCCTCGA TCGGCGCGGT GAGCGGGGGC
AGCGGCAACG ACCCGAATGC CGGCGCCAAG CTGCCGCCCC TGGCCAATGC CGAGGCCGGC
AAGCCCGCCG GCGGCGTCTC GCAGCTCGAC GTCGAGAACG CCCTGCGCTT CGACAACAAG
CTGATGGTGT TCGACACCAC CGCCCAGGGC CTCAAGAACA TCCTCGAATA CGGGGCCGGA
CTCGCGCCGG GCAACGGCGG CTATCCGCAG ATCGGCGGCG TCAAGTTCTC CTACGACCCG
TCGCAGCCGG CCGGGTCGAA GGTGCGCAGC ATCGCGCTGA CCGACGAGGC CGGCACGGTG
ATCGCCAAGG TGGTGGAGAA CGGCGCCGTT TTACCGAACG CCCCCGCCCG CATCAGCGTC
GTCTCACTGA ACTTCACGGC AAATGGTGGC GACGGCTATC CGACCAAGCA GAACGGCGAC
AATTTCCGCT ACCTGCTCGG CGACGGCCGC CTGTCGGCGC CGGTCGACAA GTCCCTCGAC
TTCACGGCGC CGGCCAACGT GCCGGCGAAC GTGGTCGGCG AGCAGGCCGC CTTCGCGCAG
TACATGCAGG CGCGCTACGG CACGGCGGAC AAGCCCTACG GCGCGGTCGA CACCGCGTCC
TCGGAGGATC TGCGCATCCA GAACCTGAGC GTCCGCGCCG ACACGGTGTT CAATTCCGGC
CGTTTCGGCA CCGGCGGCGG CGACAGCCTC GCGGGCACGG CCGGCACCGA CGAGATCTTC
GGCTATGCCG GCAATGACAC CCTGGCCGGC GGTCAGGGGA ACGATTCCAT GCTGGGCGGG
GACGGGAGCG ACCTCGTCTC GGGCCAGGAC GGGGACGATG CGGTCCTGGG CGGGGCGGGC
AACGACTTCG TGTCGGGCGG TGCCGGAAAT GACAGCGTCA ATGGCGAGGC CGGCGACGAC
CTCGTCTTCG GCGACGAGGG CGACGACATC GTCGACGGCG GCGCGGGCAA CGACCGCGTC
TACGGCGGCA CCGGCGCCGA CCGGGTTTTC GGCTCGGCCG GCAGCGACCT CGTGTTCGGC
GAGCAGGGCA ACGACTTCGT CGGGGCCGGC GAGGGCAACG ACTTCGCGTC GGGCGGGGCC
GGCAACGACG AGGTCCACGG CGAATCCGGC GACGATTACG TATTCGGCGA CGAGGGCGAC
GACCAGCTCT TCGGCGGCGA GGGCCGCGAC AGCCTCTATG GCGGGCTGGG CAGCGACGTC
CTCGACGGGG ATGCGGGCGA CGACCTCCTG GCGGGCGAGC AGGGCGACGA CGTGCTCATG
GGCGGGGCGG GGAACGACTA CCTGTCGGGC GGCCTGGGCA GCGACCAGCT CTTCGGCGGG
GACGGGGCCG ACCTTCTGTT CGGCAACGCG GGCAACGACT CGCTGTCGGG CGGGCGGGGC
TCGGACATCT TCGCGTACGG GCGCGGGGAC GGCCAGGACG TGATCCGGGA CTTCACGGTG
GGCGGGTCGG AGCGGGACGT GATCGCCTTC AACGGCGGGG TGTTCACGTC CTTCGCGGCC
GTGCAGGCGG CGACCCAGCA GCTGGGGGCG GACGCGGTGA TCACGGTGGG GGCGGGTGAC
AGCCTGACCC TGCAGAACGT CCAGGTCGCC AGCCTCTCCG CCCAGACCTT CACCTTCGCC
TGA
 
Protein sequence
MANYTLQLLH MSDGEGSTLS TQTAPVMGAL IDRFDGQYAN TLVLAGGDNY IPGPFLTAGA 
DPALNRVVGA TALGRPDVAI YNAYGVKASA IGNHEFDLGS QTVADAITPS GAWGGARFPY
LSANLDFSGD AALRGRATAG GQPSSAIAGR IAPSTIVTVN GERIGVVGAT TQLLERISSP
TGTEVNGFPK AGEPGDNLVE RDDMALLASQ LQPAIDALVA QGVNKIVLQS HLQLLSNEQA
LAPLLRGVDI ILAAGSHTRL GDATDVQAAF PGHDATSAGP YPIVTAGADG APTMIVATDS
ESTYLGRLVV DFDENGRIVP GSLNPAVSGT YAATQATLQA AYGADIARAF APGSIGARVR
EVTDAVGQVI STKERNVYGF TGVYLEGDRA FGRAQETNLG DLSADANAAA AARAVASQPY
LVSLKNGGGI LASIGAVSGG SGNDPNAGAK LPPLANAEAG KPAGGVSQLD VENALRFDNK
LMVFDTTAQG LKNILEYGAG LAPGNGGYPQ IGGVKFSYDP SQPAGSKVRS IALTDEAGTV
IAKVVENGAV LPNAPARISV VSLNFTANGG DGYPTKQNGD NFRYLLGDGR LSAPVDKSLD
FTAPANVPAN VVGEQAAFAQ YMQARYGTAD KPYGAVDTAS SEDLRIQNLS VRADTVFNSG
RFGTGGGDSL AGTAGTDEIF GYAGNDTLAG GQGNDSMLGG DGSDLVSGQD GDDAVLGGAG
NDFVSGGAGN DSVNGEAGDD LVFGDEGDDI VDGGAGNDRV YGGTGADRVF GSAGSDLVFG
EQGNDFVGAG EGNDFASGGA GNDEVHGESG DDYVFGDEGD DQLFGGEGRD SLYGGLGSDV
LDGDAGDDLL AGEQGDDVLM GGAGNDYLSG GLGSDQLFGG DGADLLFGNA GNDSLSGGRG
SDIFAYGRGD GQDVIRDFTV GGSERDVIAF NGGVFTSFAA VQAATQQLGA DAVITVGAGD
SLTLQNVQVA SLSAQTFTFA