Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_1172 |
Symbol | |
ID | 4646561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 1244661 |
End bp | 1248023 |
Gene Length | 3363 bp |
Protein Length | 1120 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639804670 |
Product | hypothetical protein |
Protein accession | YP_952013 |
Protein GI | 120402184 |
COG category | [S] Function unknown |
COG ID | [COG4913] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.381954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACTGAGC AATTCCACCT GTCGCGGCTC CAGGTCATCA ACTGGGGCGT GTTCGACGGC TACCACGACA TCCCGTTCAG CGAGGGCGGC GCACTCATCG CGGGCGCCTC GGGCAGCGGC AAATCCTCAC TGCTGGACGC GATCTCGCTC GGCTTCCTGC CGTTCAACCG ACGCAACTTC AACGCCTCCG GCGACAACAC CGCAGCAGGA TCCAGCGCGG GCCGCCGCAC CGTCGACAAG TACGTGCGCG GCGCGTGGGG CCAGCGCAGC GACGGCGGCA CCAGCCGGGT GATGTACCTG CGCGGCGACG GCACCGCCTG GTCGGCGGTG GCCGTCACCT ACGCCGGCGA CTCCGGACGC ACCGTGACGG GCCTGGTGCT CAAGTGGCTG ACCGGTGAAT CGCGCAACGA CTCGTCGAGC CGTTTCGTAC TCGGCGACGG CGACCTCGAC ATCGAGGACG TCTGCAACCG TTGGGCTGCA GGACGATTCG ACACCGGCGT GTTCAAGGAA GACGGCTGGC GGTTCACCAC CAAGGTGGAG TCGCAGTACC TGGCGCAGCT GTACGCGACC ATCGGCATCC GCGCCTCGGA TGCGGCCCAA CAGCTGCTCG GCAAGGCTAA ATCGCTGAAA AGCGTTGGTG GACTGGAACA ATTCGTCCGC GAGTTCATGC TCGACGAGCC CGAGAGTCTG ACCCGGCTGC CGGAGGCGCT CAAGCAGATC GACCCGCTGG TGGAGGCGCG CGAACTGCTG GCGGTCGCGC AGAAGAAGCG CAAGATCCTC GGCGACATCG AGAAGATCCA GCAGCGCTAC GCATCCGAGT CCACCGATCT CGGCATCATC GACCTGGTCG ACCTGCCGAT GGTGCGCGCC TACACCGACC ATGTCCGGGT GGCGCAGTGC CCGGCGCAGA TCGCGCAGCT CGACACCACG ATCGATCAGC TCGACAATGA GTACGAGGAC GTCACCCGAA GCCTGAATCT GGCCAAGGCC GAAGCAGATT CGCTCAACGC GCAGATCAGC GGGTCCAGCG CGAGCATCGG TCCGCTGCAG TCGCAGGTGA CCGCCGCCGA GACCGAGGCC GAGCAGGTGT CGCGCCGGCG CGGCGCGTAC GAGGACATGC TCGCCGCCCA GCAGCTCGAC GTGCCGGAGA CGGCCGACGA CTTCTGGAAC CTGCGCGAAG AACTGCTCGC CCAGGCCACC GAACTGCTGG CCAAGGTGGA GCGCAACCGT GAGGCCTCCA CCGACGCCGA GTACGCGCAG AAGTCGGCGC GGATGGCCCG CGACGAGGCC GCCAAGGAAC TCAAACGTGT CGAGCACGTC GGCTCGGCGC TGCCGGAGTT CGCGCTGACC ATGCGGGAAC AGATCTGCAA TGCGGTCGGT GTCGACTCCA CCGACCTGCC GTATGTCGCC GAACTGATGG ATCTCAAACC GGACCAGACC CGCTGGCGCA CCGCGGTGGA GAAGGTGCTG CGCGGTGTCG GACTGCGGCT GATGGTGCCC GATCAGCACT GGACAAAGGT GCTGCAGTTC GTCAACGAGA CGAACATGCG GGGACGGCTG CAGCTGCACC ATGTGCGGGC GAAGTTCCTC GGCGCCGAGC CGGTCGATCC GGAGCCGAAC ACGTTGGCGG CCAAGCTGTT CGCGGTCGAC CCGGCCCACC CGTGCGCCGC CGAGGCCGTC GACGTGGTCA CCGCCGCCGG CGACCACGTC TGCGTCGACA CCCCCGAGGT GTTCGCCCGG TTCCGCCGCG CGGTCACCGA CACCGGCCTG TACAAGGATT CCGACCGGCT CGCGATCAAG GACGACCGCC GCCCACTCAA GCAGTCCGAG TACCTGTATC AGGGTGACGT GTCGGCGAAG ATCAACGCAC TGACCGTCGA CCTGGCCGCG GCCGAGGAGG CCTATCAGAA GGCGCGGCGC GTCGCCGACG ACATCGCCGC GCAGCGCCAG ACCTGGCGGG ACCGGGCCGC GGCGTGCAAG GCGATCTGCG AGCAGTTCCC GCAGTGGAGC CAGATCGACA CCGAGACCGC CGACGGGCAC GCCGACCGGC TGCGCGAGCA GTACGAGCTG CTGCTGGCCG ACCACCCCGA CATCGAGGCG CTCAACGCCC GCGCCGACGA ATGCTGGTCG CAGATCCAGA AATTGATGAC GCGCCGGGGT GCGATCCAGA CCCGCCGCGA CGACCTCGAC TCCCGCCGTA CGCAGCTCCT CGAACTTCAG GAGCGGCTGC AGCCGGCATT CGTCTCGGAG CCGCTAACCG ACCTGCTGAG CCGCTACGCC AACCAGGTGC CGGTGAGCCT GGAGCTGCTG GACCCGGAGC CGCACCGCGA TGCGTTGTTC ACCGCGATCA AGAAGGAACG CGAACAGCTG CGCGAGAGCC GGCGCCGCTC CTACGACGAG CTGGCCCGCA TCCTCAACAC GTTCGACACC TCGTTCCCGG ACGCGATCCC TAACGACTCG GACAACTTCG ACGAGCGGGT GCACGACTAC GTCGCGCTGT GCCGGCACAT CGACGAGCGG GAGCTGCCCG AGGCCTACGA GCGGATGATG CGTCTGGTCA CCGAGCAGGC GCCGGATGCG ATCCTGACGC TGCACCGGGT GGCCGAGCAG GAAACCCGGC GGATCAGTGA CCAGATCGAC CGTGTCAATA CGGGTTTGGG ATCGGTGGAG TTCAACCGCG GCACCCGGCT GACGCTGCGG GCCACGCCGC GCAGCCTGAC GGCGGTGTCC GAGTTGACCG AGATCGTGCG GGCCATCTCG CGGCGCATCG CCGAGGTCGG GCTCGGCGAC AAGCAGGCGA TCCTGGATCA GTACGCCGAC ATCCTGCGGC TGCGTAACCG GCTGGCGTCG ACGGCGCCGG AGGACAAGGC GTGGACCCGC GACGCGCTCG ACGTGCGCAA CCGGTTCACG TTCGACTGCG CCGAGTGGGA TGTCGCCAGC GAGGAGCTGA TCCGCACGCA CTCCAACGCC GGCGACAACT CCGGCGGCGA GCAGGAGAAG CTGATGGCGT TCTGCCTGGC CGGTGCGCTG AGCTTCAACC TGGCCAGCCC CGACAGCACC GACAACCGGC CGGTGTTCGC GCAGCTGATG CTCGACGAGG CGTTCTCCAA GTCGGATCCG CAGTTCGCGC AGCAGGCACT GCAGGCGTTC CGCAAGTTCG GGTTCCAGCT GGTGATCGTC GCGACGGTGC AGAACGCGAC GACGATCCAG CCCTACATCG ACAGCGTGGT GATGGTGTCC AAGACCGAGG CGACGGGCCG CAACGCACGT CCGGTGGCGA CGGTGGCGAC GCGCACGATC TCCGAATTCG GCGAGCTGCG CCGCGAGATG CGGGCCGGCG CGAAGGTGCC CGCCCCGGCC TGA
|
Protein sequence | MTEQFHLSRL QVINWGVFDG YHDIPFSEGG ALIAGASGSG KSSLLDAISL GFLPFNRRNF NASGDNTAAG SSAGRRTVDK YVRGAWGQRS DGGTSRVMYL RGDGTAWSAV AVTYAGDSGR TVTGLVLKWL TGESRNDSSS RFVLGDGDLD IEDVCNRWAA GRFDTGVFKE DGWRFTTKVE SQYLAQLYAT IGIRASDAAQ QLLGKAKSLK SVGGLEQFVR EFMLDEPESL TRLPEALKQI DPLVEARELL AVAQKKRKIL GDIEKIQQRY ASESTDLGII DLVDLPMVRA YTDHVRVAQC PAQIAQLDTT IDQLDNEYED VTRSLNLAKA EADSLNAQIS GSSASIGPLQ SQVTAAETEA EQVSRRRGAY EDMLAAQQLD VPETADDFWN LREELLAQAT ELLAKVERNR EASTDAEYAQ KSARMARDEA AKELKRVEHV GSALPEFALT MREQICNAVG VDSTDLPYVA ELMDLKPDQT RWRTAVEKVL RGVGLRLMVP DQHWTKVLQF VNETNMRGRL QLHHVRAKFL GAEPVDPEPN TLAAKLFAVD PAHPCAAEAV DVVTAAGDHV CVDTPEVFAR FRRAVTDTGL YKDSDRLAIK DDRRPLKQSE YLYQGDVSAK INALTVDLAA AEEAYQKARR VADDIAAQRQ TWRDRAAACK AICEQFPQWS QIDTETADGH ADRLREQYEL LLADHPDIEA LNARADECWS QIQKLMTRRG AIQTRRDDLD SRRTQLLELQ ERLQPAFVSE PLTDLLSRYA NQVPVSLELL DPEPHRDALF TAIKKEREQL RESRRRSYDE LARILNTFDT SFPDAIPNDS DNFDERVHDY VALCRHIDER ELPEAYERMM RLVTEQAPDA ILTLHRVAEQ ETRRISDQID RVNTGLGSVE FNRGTRLTLR ATPRSLTAVS ELTEIVRAIS RRIAEVGLGD KQAILDQYAD ILRLRNRLAS TAPEDKAWTR DALDVRNRFT FDCAEWDVAS EELIRTHSNA GDNSGGEQEK LMAFCLAGAL SFNLASPDST DNRPVFAQLM LDEAFSKSDP QFAQQALQAF RKFGFQLVIV ATVQNATTIQ PYIDSVVMVS KTEATGRNAR PVATVATRTI SEFGELRREM RAGAKVPAPA
|
| |