Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_2224 |
Symbol | |
ID | 4644946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 2373747 |
End bp | 2376743 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639805708 |
Product | hypothetical protein |
Protein accession | YP_953044 |
Protein GI | 120403215 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.134781 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCTC ATGCGCGACG CCGGCAGACC AGTGCAGCGA CGAAGGGGTC GCGTCGGGTG GACCGACAGT CACGGATCGA GCCCTATGCC TGGCTGGGAG CGGGTGCGGT CACGCTGGGC ATCGGGGCGG CTTTGGTCGG CGGAGCCGGT GTCGCGCACG CCGAGGACGG AGCGGACGGC GGCTCGCCGT CGACGTCCAC CTCGAGCTCG ACCTCCGAAA GCAACTCGAG CACTCAGCAC TCGGACGCCG CGGGCGATGG TGACACCGAC AGTGCTACGC AGCAGCCGAG GAGTGACCTG CCGTCGTCGG GCCGCGTGGA AGACCCAGAG CCGCAACAAG AGCCTGATCC GAACGGTGCA TCGAGCTCGG TGGTGGACTC CGCGGAATCG AGCGCGTCTG ACCCGGAACT CCCACCCGAT GATGGCGGGA CGGAATCCGA AGAGGCAGCG TCCGATTCGG CACCGGCACG TGCCGGCACG CGCACCGACA ATTCAGATGC TTCGACTCCG GTCGAGGGAG GCAGCGACGG CCGGAAGTCC GCTGCGCAAG CCGACGACCA CAGTGCGGAG CGGACCGCGT CACCAAACAC GGCGGCGACA GACGCGACGT CGAGCATTCC GCCGGAGGAC CCCCCGGTCG GCAACGTGGG CATCGACGTC GCGGAAGCAG AAGCTGACCC CGACGGCGCC GCGACGTCCT CGACGCAATG GCTTGCGGCG GCCACCTCGC AACACGAAGC GGCCGACACC ATGGTTGCGG CCATCGTTCC CATGCCCCCA GCCGGTGTCT CGACACAGAC GGTGAATCCG AGTCGCCCGG TCAACCGCTT CGCGCTGGCG GTCCACACTC TTCTGAAACC CATCGGGAAG CTTCTGACCG AGCTGGGGGG ACTCATCCTG GGCGCCCGCG CCGACCCGGT CCCTCCCGGC GCATGCCGGG ACGGCGTCTG CAATCCCACC ACTGACATCC CACTTCCCTG GGTTCGCAAC GAAGTCACCG TCCTGAACCT GACCGGCAGG CGGGTCACCC TGACCGACCT CGACACCAAG CTTCCGCTGA CCTACGGCCC GCAGGAAGGG TTCGTCCTCG AGAATGCGCG CCAGGTCGAG CTGGCGTTCT ACCAGGGCAA CAGTTACTGG GACGAAGACT GGACCTACGA GGGCACGATG AAGTGGTCGG ACGGATCCAC GATCGTCAGC GTGGACGTCG ACGCAGCCGG CGCCATTGCG TCGACCTCCG ACAGCGGCGT GCAGACCGTC ATCTTCGAGA CTCCGGAATC CATCGGCGGC CCGTCGCTCA TCTCACTCCG GCAGACGTTG GTGCTCCTGC CGGAGGCCGG CACCGTGATC ACCATCGACA GCACGGACCC AGTCGGGCAA GCGCTCGTCG CCTCGGCCCT GTGCAAGGTG TCGGGCGGCT GCACTCAGGA AGTCGTCGAC GAGCAGGTCA TGCTCAGCGC GCCGAAGCAG GTCGGCAATA CACTGTTCAA CGCCGGAAGC GTCGTCTCGA CCAACAGGTA CAAGGTCGTC CACGAGGTGA CCAAGACCAG CGGTGTCGAG GAAAACCTGA AGGTGGTCGC CGGCACCTCA TTGAATTTCT CCCTCGGGCC GCTCGCCTTC CAGACCGAAA TCGGCGCGCT GGTGCAACAG AAGTACGGGC ACAGCTGGAG CGACGGCATC ACCACCGAGT CCCAAGTGGA TCTGGATGTG CCGCCCGGAA GCTACGGCCA GATCTACGTG CAGTACCCCG AATATCACGA CTACGTCAAC ATGACGTTGA CCAATTCGGG TGTCACCATC ACGATTCCCG ACGTCGAGTA CGTCAGCCTG GCACCCTCGG GCGCCCTCGA CAAGGAGGGG AACCCGGTCG CGGTCACCTA CTCCACTGTC GACTGGGAGA TCGGGACCGG CCCACACCCC TACCCCGATT CCAATTCAGA GCCGACGCCT CCGGAGACCG CTGCCGTCGG CTTTGTGACC CCGCCCGACA TCGCGGCGCC CGCCCCGGCG CCCACTGCCA AACCGAAGTC CCTGTTCGGC ATCATCGGCG GCTATCTGCG GGATCAGGTT CGGGCGTTCC GCATGCTGCC GAACATCGTG TTCGCCGGCC GAGCCATCAG CTCACAGACG ATCCGGGTCG TCAACCTCAC GCCCTACGCG CAGACGCTCA GCAGCATCAC CGGCGAATAC GAGGAAGACG ACTCCCCGGA GAAGGGCTTT GTCCTGCAAC CCTTCCAGGA GATCGACATC CAGGTGGATT ACAACGTCTT CCAGGACCAA GAGACGTACG TGACGTGGAC CAACGCGACG GGTATCGCCG CCTCGGCGGA GCTGAAGGTT TTCAACGGGG GCAGTGCACC CCGGGTCACC TGCCAGAGCG ACGGTTGCAT GGCGGGCAGC TACGACCGCG ACAGCGCCGT CATGTACCTG ATCTACCCGT ATGACACACC GGGCCGAATG GACGTCACGA ACGACCCGAA CCTCGCGGCT GCAGCGGTCG ACGTGGCGTG CGCGCCGGGC TATGACGACG CGATGCCCGG CTCGTGCGGG GTGAACGTCA CCGGCGACAC CTACTACAAC GCGCCGACAT CGGGACCCGT CCAGCAGCAG AACAACTTTG GCTCCCAGAC GAACTCGTAC TACTACACCA TCACCACCAC GAAGAGTGAG TCGGCGAGCT GGGCCGCCGG CGGGGGCGTC AAGCTCAAGG AGAAGGCCGG TGTGTTCGTC GGCCTGCAGA TCGAGATCGA GGCATCCGTG CTGTACAACA GTTCGGTCCA GGTCAAGGAC TCGGAGTCGA GCACAGTCGT ACAGAACTTG CTCCCGGATT ACGGGGGCGC CATCTACATC GGCGATCCCT ACCTCCGCAC GTACGGCGAC TACATCGTCA ACCTGCCCAA CCTCACCATG GTCGTCACCG GCCAGTGGAT CGAGGCGGCG TCCGGGCTCG CCGCTCAGGG ACCGGTGGCC AACGTCGTCG ACTACCCGCT GAGCTAG
|
Protein sequence | MSAHARRRQT SAATKGSRRV DRQSRIEPYA WLGAGAVTLG IGAALVGGAG VAHAEDGADG GSPSTSTSSS TSESNSSTQH SDAAGDGDTD SATQQPRSDL PSSGRVEDPE PQQEPDPNGA SSSVVDSAES SASDPELPPD DGGTESEEAA SDSAPARAGT RTDNSDASTP VEGGSDGRKS AAQADDHSAE RTASPNTAAT DATSSIPPED PPVGNVGIDV AEAEADPDGA ATSSTQWLAA ATSQHEAADT MVAAIVPMPP AGVSTQTVNP SRPVNRFALA VHTLLKPIGK LLTELGGLIL GARADPVPPG ACRDGVCNPT TDIPLPWVRN EVTVLNLTGR RVTLTDLDTK LPLTYGPQEG FVLENARQVE LAFYQGNSYW DEDWTYEGTM KWSDGSTIVS VDVDAAGAIA STSDSGVQTV IFETPESIGG PSLISLRQTL VLLPEAGTVI TIDSTDPVGQ ALVASALCKV SGGCTQEVVD EQVMLSAPKQ VGNTLFNAGS VVSTNRYKVV HEVTKTSGVE ENLKVVAGTS LNFSLGPLAF QTEIGALVQQ KYGHSWSDGI TTESQVDLDV PPGSYGQIYV QYPEYHDYVN MTLTNSGVTI TIPDVEYVSL APSGALDKEG NPVAVTYSTV DWEIGTGPHP YPDSNSEPTP PETAAVGFVT PPDIAAPAPA PTAKPKSLFG IIGGYLRDQV RAFRMLPNIV FAGRAISSQT IRVVNLTPYA QTLSSITGEY EEDDSPEKGF VLQPFQEIDI QVDYNVFQDQ ETYVTWTNAT GIAASAELKV FNGGSAPRVT CQSDGCMAGS YDRDSAVMYL IYPYDTPGRM DVTNDPNLAA AAVDVACAPG YDDAMPGSCG VNVTGDTYYN APTSGPVQQQ NNFGSQTNSY YYTITTTKSE SASWAAGGGV KLKEKAGVFV GLQIEIEASV LYNSSVQVKD SESSTVVQNL LPDYGGAIYI GDPYLRTYGD YIVNLPNLTM VVTGQWIEAA SGLAAQGPVA NVVDYPLS
|
| |