Gene M446_4818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4818 
Symbol 
ID6131249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5293160 
End bp5294530 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content71% 
IMG OID641644955 
ProductABC transporter nitrate-binding protein 
Protein accessionYP_001771582 
Protein GI170742927 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.191179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00821783 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCCTGT TCGACGATCC CTTCGACGCG CGGCGGCGCC TGCGGCGGGG CGGTTGCGCC 
TGCGGCGCGC ACGAGAGCCA GGCCGCGCAC GACGCGGCCG CGGCCGCGGA GGCTCCGGCG
GAGGCGCGGG CCGAGCGGCT GGTGGAGGGC GCGGTGATGC GGGCGCTCTT CCCCCGCGAC
GCCACCCGCC GGGCCTTCCT GGCGGCGGTG GGGGCGGGGG CGGCGGCGGC CGCCCTGCGC
GAGGTGCTGC CGATCGGCTT CGTCACGGAG GCCTTCGCGC AGGCCGGCGC GCCGGAGCGG
AAGGACCTCA AGGTCGGCTT CATCCCGATC ACCTGCGCGA CGCCGATCAT CATGGCGGCG
CCGATGGGCT TCTACGCCAA GCAGGGCCTC GCCGTGGAGG TGGTGAAGAC CGCCGGCTGG
GCCGTCATCC GCGACAAGAC CCTGAGCAAG GAGTACGACG CCGCCCACAT GCTCGCGCCG
ATGCCGATCG CGATCTCGCT CGGCATCGGC TCGACCCCGC AGCCCTACAC GATGCCGGCG
GTCGAGAACG TCAACGGGCA GGCGATCACC CTCTCGGTGA AGCACAAGGA CCGGCGCGAT
CCCAAGTCCT GGAAGGGCTT CAGGCTCGCG GTGCCGTTCG ACTACTCGAT GCACAATTAC
CTGCTGCGCT ACTACCTGGC GGAGCACGGC ATCGACCCGG ACACCGACGT GCAGATCCGG
GCCGTGCCGC CGCCCGAGCT GGTCGCCAAC CTGCGGGCAG AGAACATCGA CGGGTTCCTG
GCGCCGGACC CGGTCAACCA GCGCGCGGTC TACGACGGGG TCGGCTTCAT CCACCTCCTC
TCGAAGGAGA TCTGGGACCG GCATCCCTGC TGCGCCTTCG CGGCCTCGCA GGCCTTCGCC
ACCGAGACGC CCAACACCTA CGCGGCCCTG CTGCGGGCGA TCATCGAGGC GACCGCCTAC
GCCTCGAAGC CGGAGAACCG CAAGGAGATC GCGGCCCAGA TCGCGCCGGC CAACTACCTC
AACCAGCCCG TGACGGTGGT GGAGCAGGTG CTCACCGGCA CCTTCGCGGA CGGGCTCGGC
AGCGTGCGCA GGGTGCCCGA CCGGATCGAT TTCGACGCGT TCCCGTGGCA CTCCTTCGCG
GTCTGGATCC TCACCCAGAT GAAGCGCTGG GGGCAGGTCA AGGGCGACCT CGACTACCGG
GCGGTGGCCG AGAAGGTCTA CCGCGCCACG GACGCCGCCA AGCTGATGGC GCAGGCCGGG
CTCAACCCCC CCGCCGCCAC CTCGAAGACC TTCGTGGTCA TGGGCCGGAC CTTCGACCCC
GACAGGCCCA AGGAGTACCT CGACTCCTTC GCCATCAGGC GCGCGAGCTG A
 
Protein sequence
MALFDDPFDA RRRLRRGGCA CGAHESQAAH DAAAAAEAPA EARAERLVEG AVMRALFPRD 
ATRRAFLAAV GAGAAAAALR EVLPIGFVTE AFAQAGAPER KDLKVGFIPI TCATPIIMAA
PMGFYAKQGL AVEVVKTAGW AVIRDKTLSK EYDAAHMLAP MPIAISLGIG STPQPYTMPA
VENVNGQAIT LSVKHKDRRD PKSWKGFRLA VPFDYSMHNY LLRYYLAEHG IDPDTDVQIR
AVPPPELVAN LRAENIDGFL APDPVNQRAV YDGVGFIHLL SKEIWDRHPC CAFAASQAFA
TETPNTYAAL LRAIIEATAY ASKPENRKEI AAQIAPANYL NQPVTVVEQV LTGTFADGLG
SVRRVPDRID FDAFPWHSFA VWILTQMKRW GQVKGDLDYR AVAEKVYRAT DAAKLMAQAG
LNPPAATSKT FVVMGRTFDP DRPKEYLDSF AIRRAS