Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3458 |
Symbol | |
ID | 6133104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 3842659 |
End bp | 3843696 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641643625 |
Product | ABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like protein |
Protein accession | YP_001770277 |
Protein GI | 170741622 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.304874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.001883 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCTTCA CCCGACGCAC CTTCCTCGGC GCCGCCCCGC TCGTCGGCGG CGCGGCGCTC GCCGCGGCGG GGCGCGCGCG GGCGCAGGGC CGGGCGGCGC CCCTGCGGGT CGGCATCATC CCGATCCTCG CCGCCGCGCC GATCTTCGTC GCCGAGAAGG AGGGCTGGCT CAAGGAGGCC GGTCTCACGC TCGCGATCAC CACCTTCGAA TCCGGGCCGA ACATGATCCA GGCCCTGGCC TCGGGCACGC TCGACGTCTA CGTGGCCGGC GTGGCGCCCC TCGGGGTGGC CCGCGCGCGC GGCATCGACG TCAAGGTCGT GACCGCGACC GCGGTCGAGG AGAACGTCTT CGTCGCGGGA TCGCGGCTCG CCCGCCACTT CGAGCCCGGC CTCGCGCCCG CGGAGGCGTT CCGGCGCTAC CGCGCCGCCG CCGGCGCGCC GGCCCGGCTC GCCACCCAGC CGCTCGGCTC GGTGCCCAAC ACCACCCTGC AGCACTGGCT GTGGGAGGTG GCCAAGGCCG ATCCCAAGGA CGTCACCCTC GTGTCGATGG GCATCGACGC CACCCAGCAG GCCATCCTGG TCGGCGCCGT CGAGGGCGGG ACCCTGCGCG AGCCGGCCGT GTCGATCGTC ACCGGCCGCG ATTCGGGCAT CCGCCTCGTC GCCCTGGGCG GCGCGATGTT CCCCGGCCAG CCCGGCACCG TGGTGGCCCT GACCCGGGCG CTCCTCGACC GGGAGCCCGA GGCCGCGCAG GCGGTGGTGA CCGGGATCGT CCGCGCCGTC GACCTGATCG GCCGCGAGCC CGGCCGCGTC GCGCCGGTGG TCGAGGCCGC CCTCGGCAAG GGACTCGTCG ACCTCGCCAC GATCCGCCGG GCCCTCGCCT CGCCGGCGAC GCGCTTCACC GCCGATCCGG GCGCGATCGT GGAGGCCACC GCCGCCATGC AGCGCTACCA GGTCAAGCTC GGCGCCCTCG ACCGCGAGGT CCCCCTCAAC GGCCTGTTCG AGCCGCGCCT CTACGCGCGC GCCGCGGCCT CCCGGTGA
|
Protein sequence | MTFTRRTFLG AAPLVGGAAL AAAGRARAQG RAAPLRVGII PILAAAPIFV AEKEGWLKEA GLTLAITTFE SGPNMIQALA SGTLDVYVAG VAPLGVARAR GIDVKVVTAT AVEENVFVAG SRLARHFEPG LAPAEAFRRY RAAAGAPARL ATQPLGSVPN TTLQHWLWEV AKADPKDVTL VSMGIDATQQ AILVGAVEGG TLREPAVSIV TGRDSGIRLV ALGGAMFPGQ PGTVVALTRA LLDREPEAAQ AVVTGIVRAV DLIGREPGRV APVVEAALGK GLVDLATIRR ALASPATRFT ADPGAIVEAT AAMQRYQVKL GALDREVPLN GLFEPRLYAR AAASR
|
| |