Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_4818 |
Symbol | |
ID | 6131249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 5293160 |
End bp | 5294530 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641644955 |
Product | ABC transporter nitrate-binding protein |
Protein accession | YP_001771582 |
Protein GI | 170742927 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.191179 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00821783 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCCTGT TCGACGATCC CTTCGACGCG CGGCGGCGCC TGCGGCGGGG CGGTTGCGCC TGCGGCGCGC ACGAGAGCCA GGCCGCGCAC GACGCGGCCG CGGCCGCGGA GGCTCCGGCG GAGGCGCGGG CCGAGCGGCT GGTGGAGGGC GCGGTGATGC GGGCGCTCTT CCCCCGCGAC GCCACCCGCC GGGCCTTCCT GGCGGCGGTG GGGGCGGGGG CGGCGGCGGC CGCCCTGCGC GAGGTGCTGC CGATCGGCTT CGTCACGGAG GCCTTCGCGC AGGCCGGCGC GCCGGAGCGG AAGGACCTCA AGGTCGGCTT CATCCCGATC ACCTGCGCGA CGCCGATCAT CATGGCGGCG CCGATGGGCT TCTACGCCAA GCAGGGCCTC GCCGTGGAGG TGGTGAAGAC CGCCGGCTGG GCCGTCATCC GCGACAAGAC CCTGAGCAAG GAGTACGACG CCGCCCACAT GCTCGCGCCG ATGCCGATCG CGATCTCGCT CGGCATCGGC TCGACCCCGC AGCCCTACAC GATGCCGGCG GTCGAGAACG TCAACGGGCA GGCGATCACC CTCTCGGTGA AGCACAAGGA CCGGCGCGAT CCCAAGTCCT GGAAGGGCTT CAGGCTCGCG GTGCCGTTCG ACTACTCGAT GCACAATTAC CTGCTGCGCT ACTACCTGGC GGAGCACGGC ATCGACCCGG ACACCGACGT GCAGATCCGG GCCGTGCCGC CGCCCGAGCT GGTCGCCAAC CTGCGGGCAG AGAACATCGA CGGGTTCCTG GCGCCGGACC CGGTCAACCA GCGCGCGGTC TACGACGGGG TCGGCTTCAT CCACCTCCTC TCGAAGGAGA TCTGGGACCG GCATCCCTGC TGCGCCTTCG CGGCCTCGCA GGCCTTCGCC ACCGAGACGC CCAACACCTA CGCGGCCCTG CTGCGGGCGA TCATCGAGGC GACCGCCTAC GCCTCGAAGC CGGAGAACCG CAAGGAGATC GCGGCCCAGA TCGCGCCGGC CAACTACCTC AACCAGCCCG TGACGGTGGT GGAGCAGGTG CTCACCGGCA CCTTCGCGGA CGGGCTCGGC AGCGTGCGCA GGGTGCCCGA CCGGATCGAT TTCGACGCGT TCCCGTGGCA CTCCTTCGCG GTCTGGATCC TCACCCAGAT GAAGCGCTGG GGGCAGGTCA AGGGCGACCT CGACTACCGG GCGGTGGCCG AGAAGGTCTA CCGCGCCACG GACGCCGCCA AGCTGATGGC GCAGGCCGGG CTCAACCCCC CCGCCGCCAC CTCGAAGACC TTCGTGGTCA TGGGCCGGAC CTTCGACCCC GACAGGCCCA AGGAGTACCT CGACTCCTTC GCCATCAGGC GCGCGAGCTG A
|
Protein sequence | MALFDDPFDA RRRLRRGGCA CGAHESQAAH DAAAAAEAPA EARAERLVEG AVMRALFPRD ATRRAFLAAV GAGAAAAALR EVLPIGFVTE AFAQAGAPER KDLKVGFIPI TCATPIIMAA PMGFYAKQGL AVEVVKTAGW AVIRDKTLSK EYDAAHMLAP MPIAISLGIG STPQPYTMPA VENVNGQAIT LSVKHKDRRD PKSWKGFRLA VPFDYSMHNY LLRYYLAEHG IDPDTDVQIR AVPPPELVAN LRAENIDGFL APDPVNQRAV YDGVGFIHLL SKEIWDRHPC CAFAASQAFA TETPNTYAAL LRAIIEATAY ASKPENRKEI AAQIAPANYL NQPVTVVEQV LTGTFADGLG SVRRVPDRID FDAFPWHSFA VWILTQMKRW GQVKGDLDYR AVAEKVYRAT DAAKLMAQAG LNPPAATSKT FVVMGRTFDP DRPKEYLDSF AIRRAS
|
| |