Gene M446_3446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3446 
Symbol 
ID6129870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3830333 
End bp3832486 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content72% 
IMG OID641643614 
ProductTonB-dependent heme/hemoglobin receptor family protein 
Protein accessionYP_001770266 
Protein GI170741611 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01785] TonB-dependent heme/hemoglobin receptor family protein
[TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.494808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00031135 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATCGTCC GCTGCCTTCC GTCCCGCCGT TCCGGTCTCG CGGCCGGCCT CTCCCTCCTC 
GCCGTCGCGT CCGCTCAGGC GCGGGATGCG GCGGCGCCCG CCCCGGCGGC CCGGACCGAT
GCCCCGATCG CCCTCGACAC GATCTCGGTG ACCTCCACGA GGACCGAGGC GCCGGCCATC
GCCGCCCTGT CGGGGACGAG CGTGGTCACC CGCGCGCAGA TCGACCGGAT CCAGCCGAGC
CGCCTGTCCG ACCTGCTGCG CGACGTGCCC GGCGTGACGA CGCAGGAGAA CCAGAACGAT
CCGGCCCAGG CGGTCAACAT CCGCGGGCTG CAGGATTTCG GGCGGGTCAA CGTGCTGGTG
GACGGCGCCC GCCAGGATTT CCAGACCTCC GGGCACAGCG CCAACGGCGT GTTCTACCTC
GATCCGGAAC TCGTCGGCGG GGTCGACGTC ACGCGCGGGC CCGCCTCCAC CCTCTACGGC
TCCGGCGCGA TCGGCGGCGT GGTCGCGTTC CGCACGCGCG CGATCGACGA CATCCTGACG
CCTGACGAGA CGATGGGCGC GGTGCAGAGA TGGGGGTTCG GCGGCAACGG GCAGCGATTC
GTGAGCACGA CGGCGCTCGG CGCCCGCATC GGCCCGGCCG CGGACCTGTT CGGCCAGTTC
GTCTCGCGCC ACGCCGATCC CTACCGCGAC GGTTCCGGCA CCCTGGTGCG CGACACCGGC
AGCGCGCTGA CCGCGGGCCT CCTCAAGGCC AATCTGCGGC CCGCCGAGGG CCACGAGATC
AGCGGCACCG CCCTGATCCA GGATTTCGAC TTCGCCAATG CCGGCACCGG CCAGGCGGGG
GCCCGCTTCG CCAACAGGGT GCAGGCCGAC ACCTACACCC TCGGCTACCG GTTCGCCCGA
CCTGACGTGC CGCTGATCGA CCTCGACGCC AAGGTCTACC GGACCTCGAC GCACGACCTC
CTGACCTTCC TCTCCGACGC GCCGAGCGGC CTCTACGGCG GTCTCGGCGC GCGTCCGGGC
AATCGCATCG ACTACGACAT CGCGACCTCG GGCCTCGACA TCCACAACTC GGCCCGATTC
GACACCTGGG CGCTGAGCCA CCGGCTGACG CTCGGAGGGG ATTCCGTGCT CGACCGCGTC
GAGACGGACG ACCGGGCGGG CAGCTTCGGG GCCGCCTTCA CCCCGTCCGG CACCCGCAGG
CTCGCGGGAG CGTTCGTCCA GGATGAGGTG GGCTATTCCT CGTGGCTGCG CGTGGTCGGG
GCGCTCCGCT ACGACCGCTA CGCATTGTCC GGCGGCGCCG TGCACGCCCG CGGTGAGCGC
CTCTCTCCGA GGATCACCGT CGGCGTCACG CCCCTCACCG GAATCACGGT CTTCGGGACC
TACGCGGAGG GCTATCGCGC CCCCTCGATC ACCGAGGCCC TGGTGCAGGG CATCCACCCC
TTCCCGGCCT TCACCCTCCT GCCCAATCCG GCGCTCCGGC CGGAGGTGGC GCGCACCATC
GAGGGCGGCG TCAACCTCGC CTCCGGCGAT CTGCTTCGGC CCGGCGACAC GTTCAGGGCC
AAGCTCAGCG TGTTCTCGAC CGGCATCGCC GATTTCATCG CCGTCGGGCC GGTTGGCCCG
ACCTATCTCG TGCCGGCGCT TCCCGGCCTG CCGGCCTCGG CCTGCGCGCG CGGCGCGGGG
CGGTTTCCGT GCGTGATCCC GGTTCGGTCG CTGCAGTACC GCAACGTCGC CCGCGCGGAC
CTCTCCGGCG TCGAGCTGGA GGGCGCCTAC GATTGGGGCG GGGGCTTCGT CTCGCTCGCG
GCGACCCACA CCGACGGGCG CGACGCGGCG ACCCGCGAGA CGCTCCTGAC GGTGCCGCCG
GACCGGATCA GCGCGACCCT GGGACTGCGC TTCCTCGGCG AGCGCCTCAC GCTCGGGGGC
CGCCTCACCG TCGTGGAGGC GCGCCGGGCC GTGCCGGCCG GCGCGACCAT CCTGGCGACG
AAGGGCTGCG GCGTCGTCGA CCTCTTCGCC TGCTACCGCT TCGACGATCG CGTCCGGGCC
GACGTCATCG TCCAGAACGC GCTCGACAGG CGCTACACGC AATACCTCAA CCTGCTGCAG
AGCCCGGGCC TGACCGCCAA AGCCGCGCTC ACCATCGCGT TCGCCACGTG CTGA
 
Protein sequence
MIVRCLPSRR SGLAAGLSLL AVASAQARDA AAPAPAARTD APIALDTISV TSTRTEAPAI 
AALSGTSVVT RAQIDRIQPS RLSDLLRDVP GVTTQENQND PAQAVNIRGL QDFGRVNVLV
DGARQDFQTS GHSANGVFYL DPELVGGVDV TRGPASTLYG SGAIGGVVAF RTRAIDDILT
PDETMGAVQR WGFGGNGQRF VSTTALGARI GPAADLFGQF VSRHADPYRD GSGTLVRDTG
SALTAGLLKA NLRPAEGHEI SGTALIQDFD FANAGTGQAG ARFANRVQAD TYTLGYRFAR
PDVPLIDLDA KVYRTSTHDL LTFLSDAPSG LYGGLGARPG NRIDYDIATS GLDIHNSARF
DTWALSHRLT LGGDSVLDRV ETDDRAGSFG AAFTPSGTRR LAGAFVQDEV GYSSWLRVVG
ALRYDRYALS GGAVHARGER LSPRITVGVT PLTGITVFGT YAEGYRAPSI TEALVQGIHP
FPAFTLLPNP ALRPEVARTI EGGVNLASGD LLRPGDTFRA KLSVFSTGIA DFIAVGPVGP
TYLVPALPGL PASACARGAG RFPCVIPVRS LQYRNVARAD LSGVELEGAY DWGGGFVSLA
ATHTDGRDAA TRETLLTVPP DRISATLGLR FLGERLTLGG RLTVVEARRA VPAGATILAT
KGCGVVDLFA CYRFDDRVRA DVIVQNALDR RYTQYLNLLQ SPGLTAKAAL TIAFATC