Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3388 |
Symbol | |
ID | 4644998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 3605074 |
End bp | 3606255 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639806866 |
Product | extracellular solute-binding protein |
Protein accession | YP_954191 |
Protein GI | 120404362 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.645484 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGTG AGATCGACCC GCAACTGTTG GCCCGACTGA ACGCGCGCCG GACCTCCCGC CGCCGGTTCA TCGGTGGCGG CGCCGCAGCC GCCGCGGGCC TGGCCCTCGG TTCGTCGTTC CTGGCGGCGT GCGGGTCCGA CAGTGGAACG TCGAGCACCA CCTCGGAGGC CAGTGGCCCC GCCAGCGGCA CCCTGCGCAT CTCGAACTGG CCGTTGTACA TGGCCGACGG TTTCGTCGCC GCATTCCAGA CCGCCTCCGG CATCACCGTC GACTACAAAG AGGACTTCAA CGACAACGAG CAGTGGTTCG CCAAGGTCAA GGAGCCGTTG TCGCGCAAGC AGGACATCGG CGCCGACCTG GCCGTTCCGA CGTCGTTCCT TGCGGTGCGG CTGCATCAGC TCGGCTGGCT CAACGACATC AGCGACGAAG GTGTGCCGAA CAAGAAGAAC ATCCGTCCGG ACCTGCTCGA GGCCAGCGTC GACCCGGGCC GCAAGTTCAG CGCCCCGTAC ATGTCGGGCC TGGTCGGCCT TGCCTACAAC CGCGCCGCCA CCGGCCGCGA CATCAAGACG ATCGACGACC TGTGGGATCC GGCGTTCAAG GGCCGGGTCA GCCTGTTCTC CGACGCCCAG GACGGCCTCG GCATGATCAT GCTCTCGCAG GGCAACTCGC CGGAGAACCC CTCCATGGAG TCGGTCCAGA AGGCGGTCGA TCTGGTCCGT GAGCAGAACG ACAAGGGCCA GATCCGCAGG TTCACCGGCA ACGACTACGC GGACGACCTT GCTGCGGGCA ACGTCGCCGT GGCACAGGCG TATTCGGGTG ACGTGGTCCA GCTTCAGGCG GACAACCCCG ATCTGCAGTT CATCGTTCCG GAGTCCGGTG CGACGACATT CGTCGACACG ATGGTGATCC CCTACACGAC GCAGAACCAG AAGGCCGCCG AGGCGTGGAT CAACTACGTA TACGACAGGG CCAATTACGC GAAGCTGGTG TCGTACGTCC AGTACGTTCC GGTGCTGTCG GACATGACCG AGGAACTGGA GAAGATCGAT CCGGCCGCTG CGGCCAACCC ACTGATCAAC CCGCCCGCCG ACGTCCTGGC GAAGTCCAAG GGCTGGGCCG CACTCACCGA CGAGCAGACG CAGGAGTACA ACACCGCGTA CGCCGCCGTC ACCGGCGGCT GA
|
Protein sequence | MSREIDPQLL ARLNARRTSR RRFIGGGAAA AAGLALGSSF LAACGSDSGT SSTTSEASGP ASGTLRISNW PLYMADGFVA AFQTASGITV DYKEDFNDNE QWFAKVKEPL SRKQDIGADL AVPTSFLAVR LHQLGWLNDI SDEGVPNKKN IRPDLLEASV DPGRKFSAPY MSGLVGLAYN RAATGRDIKT IDDLWDPAFK GRVSLFSDAQ DGLGMIMLSQ GNSPENPSME SVQKAVDLVR EQNDKGQIRR FTGNDYADDL AAGNVAVAQA YSGDVVQLQA DNPDLQFIVP ESGATTFVDT MVIPYTTQNQ KAAEAWINYV YDRANYAKLV SYVQYVPVLS DMTEELEKID PAAAANPLIN PPADVLAKSK GWAALTDEQT QEYNTAYAAV TGG
|
| |