Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3451 |
Symbol | |
ID | 6129875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 3835989 |
End bp | 3837725 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641643618 |
Product | ABC transporter related |
Protein accession | YP_001770270 |
Protein GI | 170741615 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0600] ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [COG1116] ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.90461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00285782 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGCTCG TGGCCGACCG CACCGCCCCG CCGGAAGCCG CGCTCCCGGA GCCCCAAAGC GGCGAGGCGC CCGCCGCCCT GCCGGCGCCG CTCCTGCGGA TCGCGGGCGT CTCCCTGGAA TACCGCACGC CCGAGCGGGT GGTCCGGGCG ACGCACCGCA TCGACCTCGA CATCCACCGG GCCGACCGCT TCGTGCTCCT CGGCCCCTCG GGCTGCGGCA AGTCCACCCT GCTCAAGGCC GTGGCGGGCT TCCAGACGCC GGTCGAGGGC GAGATCGTCC TCGACGGGCG GCCGGTGCGC GGCCCGGGGC CGGACCGGAT CGTGGTGTTC CAGGAGTTCG ACCAGCTGCC GCCCTGGAAA ACGGTGCTGC AGAACGTCGC CTTCCCGCTG CGGGCCTCCC GCACCCTCGG CCGGCGCGAG GCGGAGGCGC GGGCGCGCCA CGTCATCGAC AAGGTCGGCC TGTCGCGCTT CGCCGGCAGC TACCCCCACC AGCTCTCGGG CGGGATGAAG CAGCGCGTCG CCATCGCCCG GGCGCTCGCC ATGGAGCCGA AGGTGCTGCT GATGGACGAG CCCTTCGCGG CCCTCGACGC GCTGACCCGC CGCCGCATGC AGGAGGAGCT GCTGGCGCTC TGGGACGAGG TCCGCTTCAC CCTCCTGTTC GTCACCCACT CGATCGAGGA GGCGCTGGTG GTCGGCAACC GCGTGGCCGT GCTGTCGCCG CATCCGGGCC GGGTGCGGGG CGAGTTCAAC AGCCACGCCT TCGACCTCGC GAGCGTCGGC AGCGCCGCCT TCCAGGCCCG CCGTGCAGCG CCTGCACCAG CGGCTGTTCG ACTCATCGCC CGCCGCAGGA GCACCCGCCC CCCGATGAAC GCGCCGCTCC TCCCCCCGAT CCGCCCGGAA TACGAGCGGG CGCTGCCGCC CTTCGTCGAG GCGCCGGTGT CGCGGGACCT GCCGCTCGCC GCCCGGATCG GGCAGACCGG CGCCCTGCGC CGGGGCCTGA TCCTGCTCGC CCTCGCCCTC GCCTGGGAGG GCCTCGCCCG CTGGCAGGAC AACGACCTGC TGCTGCCGGG CTGCCTCGCC ACCCTCTCGG CCCTCGCCGA GGGGCTGGCG AGCGGCGAAC TCGTCGCGCG CGCGCGGATC TCGCTCGGCG TGCTCGCCCA GGGCTACGCG GCGGGGGTCG GGCTCGCCTT CCTGCTGACG ACGCTCGCGG TCTCGACGCG GGCGGGCCGC GACCTGCTCA CGACGCTCAC CGCGATGTTC AACCCGCTGC CTGCCATCGC GCTGCTGCCG CTGGCCCTGC TCTGGTTCGG CCTGGGCAAC GGCAGCCTGA TCTTCGTCCT GATCCACGCG GTGCTGTGGC CGCTCGCGCT CAACACCTTC GCGGGCTTCC AGGGCGTGCC CGAGACCCTG CGGCTCGCGG GCCGCAATTA CGGGCTCACC GGCCTCGCCT ACGTCTGGCA GATCCTGATC CCCGCCGCCC TGCCGGCGAT CCTGTCCGGG TTGAAGATCG GCTGGGCCTT CGCGTGGCGC ACCCTGATCG CGGCGGAACT CGTCTTCGGC GCGGCGTCCG GGCGGGGCGG CCTCGGCTGG TACATCTTCC AGAACCGCAA CGAGCTGTAT ACGGATCGGG TCTTCGCGGG CCTGCTCCTG GTCATCGCGA TCGGCCTCGC GGTCGAGACG ATCGTGTTCG CGACGGTCGA GCGCGCGACC ACGCGGCGCT GGGGTATGGT CCGGTAG
|
Protein sequence | MTLVADRTAP PEAALPEPQS GEAPAALPAP LLRIAGVSLE YRTPERVVRA THRIDLDIHR ADRFVLLGPS GCGKSTLLKA VAGFQTPVEG EIVLDGRPVR GPGPDRIVVF QEFDQLPPWK TVLQNVAFPL RASRTLGRRE AEARARHVID KVGLSRFAGS YPHQLSGGMK QRVAIARALA MEPKVLLMDE PFAALDALTR RRMQEELLAL WDEVRFTLLF VTHSIEEALV VGNRVAVLSP HPGRVRGEFN SHAFDLASVG SAAFQARRAA PAPAAVRLIA RRRSTRPPMN APLLPPIRPE YERALPPFVE APVSRDLPLA ARIGQTGALR RGLILLALAL AWEGLARWQD NDLLLPGCLA TLSALAEGLA SGELVARARI SLGVLAQGYA AGVGLAFLLT TLAVSTRAGR DLLTTLTAMF NPLPAIALLP LALLWFGLGN GSLIFVLIHA VLWPLALNTF AGFQGVPETL RLAGRNYGLT GLAYVWQILI PAALPAILSG LKIGWAFAWR TLIAAELVFG AASGRGGLGW YIFQNRNELY TDRVFAGLLL VIAIGLAVET IVFATVERAT TRRWGMVR
|
| |