Gene M446_2964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2964 
Symbol 
ID6130852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3286601 
End bp3288322 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content76% 
IMG OID641643155 
Productheparinase II/III family protein 
Protein accessionYP_001769810 
Protein GI170741155 
COG category[S] Function unknown 
COG ID[COG5360] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.534511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.040363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACTGGG GGCCTGACCG CTGGCGCTTC TACCGGCTCG TCGGGCGCGA GATCGGCCGC 
GCCGCCCGCT CCGCCATCCT CCACCTCTCC TCGGCGCCCT CGGCCTTCGC GCGGCTGCGG
CCGACCGGCC TCGTCATCGC GCCCCAGGAC CTGCGCACCA GCGACGCCAC CCTGGCGGGC
GACATCTACG CGGGCCTGTT CGTGTTCGCC GGCCGGGCCC TCGCCACGGG CGGGCGCTCG
CCCTTCGACT TCACCCCGCC CTCGCCCGAA TGGGGGGAGG CGCTCTACGG GTTCGGCTGG
CTGCGGCACC TGAGCGCCGC CGACAGCGCG CTCGCCCGCT CCAACGCCCG CGCCTTCGTG
CAGGACTACC TCGCCGGGCG CGGCGACCGC GACATCGCGG ACCGGCCCGC CGTCGCGGCC
CGCCGGCTGA TCTCGCTGCT CAGCCAGTCG CCCCTCGTCC TCGAAGGGGC GGACCACGCC
TTCTACCAAG CCTTCCTGAA GGGGATCGGG CGCTGCGTGC GCCGCCTCGA CACGGCCCTG
AGGCACGGCA TCCCGCCGCG CCAGCGCCTC GCCGCCGCGG TCGCGCTGAC CTATGCGGGC
CTGTGCTGCG ACGGGTTCGA GCCGGTGCTG CGCCGGGGCG TGCGCCTCCT CTCCCGGGAG
CTCACCCGGC AGATCCTGGC GGATGGCGGC CATCGCAGCC GCGACCCGGC GGCCGTGCAG
GAACTCCTGC TCGACCTGCT GCCCCTGCGC CAGAGCTTCC TCAGCCGCGA CATCGCCCCG
CCCGATCCGC TGCTCGTCGC GATCGACCGG ATGCTGCCCT TCCTGCGGCT GCTGCGCCAC
GGGGACGCCT CCCTCGGCCA CCACAACGGC ATGGGGGCGA CGGAGGCCGA CCAGCTCGCG
ACCCTGCTGA TCTACGACGG GGCGCTGGCC CGCCCGCTGA TGCACGCGCC CCACACCGGC
TACGCGCGGC TGGAGGCGGG CCGCACGCTC CTCGTCGCGG ATATCGGCCC CGCGCCGCCG
CTGGTCTACT CGACCGTGGC CGGGGCGGGC TGCCTGTCCT TCGAACTGTC GAGCGGCGCC
CAGCGCCTCG TGGTCAATTG CGGCATGCCG GCGAGCGGCG ACGAGATGCG CCAGCTCGCC
CGCACCACGG CGGCGCACTC GACCGCGGTG CTCGGAACCG CCTCGTCCTG CCGCTTCCTC
AACCCGCCCG GGCCCGGCTT CGCGCACCCG ATCGCCGCCT GGCTGCGCCA CCGGATCGGG
CCGGTGATCC TGCGCGGGCC GCTCCGGGTC CCGGTCGAGC GCGGCAGCGC GCCGGACGGC
ACGGCGGTGC TCGCGGCGAG CCACGACGGC TACCTCGCGG GATTCGGGCT GATCCATGAG
CGGCGCTGGC GCCTCGCGCC GGCGGGCGAC CTCCTGGAGG GCGAGGACGC CTTCCGGCGG
CCGGAGCGGG AGGGGCGCGC CCGCGGCGCC GGCCGCCCGG TCGAGGCGGC GATCCGCTTC
CACCTGCATC CCTCCCTGCG GGTGAGCCGG GAGGGCCGGG CGGTGCGCCT GCAGGGCGCG
GGCGGCGAGG CGTGGCTGTT CGAGACCGAG GCGACCGATC CGGTGATCGA GGAGAGCGTC
TTCTTCGCGG CCTCGAACGG CGCCCGGCGG ACCGAGCAGA TCGTGCTGCA CCTCAAGGTC
GCGGACGGCA CCCGGCTGGC GTGGCGCTTC CGGCGCGCGT GA
 
Protein sequence
MHWGPDRWRF YRLVGREIGR AARSAILHLS SAPSAFARLR PTGLVIAPQD LRTSDATLAG 
DIYAGLFVFA GRALATGGRS PFDFTPPSPE WGEALYGFGW LRHLSAADSA LARSNARAFV
QDYLAGRGDR DIADRPAVAA RRLISLLSQS PLVLEGADHA FYQAFLKGIG RCVRRLDTAL
RHGIPPRQRL AAAVALTYAG LCCDGFEPVL RRGVRLLSRE LTRQILADGG HRSRDPAAVQ
ELLLDLLPLR QSFLSRDIAP PDPLLVAIDR MLPFLRLLRH GDASLGHHNG MGATEADQLA
TLLIYDGALA RPLMHAPHTG YARLEAGRTL LVADIGPAPP LVYSTVAGAG CLSFELSSGA
QRLVVNCGMP ASGDEMRQLA RTTAAHSTAV LGTASSCRFL NPPGPGFAHP IAAWLRHRIG
PVILRGPLRV PVERGSAPDG TAVLAASHDG YLAGFGLIHE RRWRLAPAGD LLEGEDAFRR
PEREGRARGA GRPVEAAIRF HLHPSLRVSR EGRAVRLQGA GGEAWLFETE ATDPVIEESV
FFAASNGARR TEQIVLHLKV ADGTRLAWRF RRA