Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3094 |
Symbol | |
ID | 4646850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 3261533 |
End bp | 3262888 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639806571 |
Product | hypothetical protein |
Protein accession | YP_953902 |
Protein GI | 120404073 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.292492 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.43311 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCA ATTTGTCGTT GCTGGCGTTG TTGGCGTTCA TGCTGCTCAC CGCGGGAACC GCGGTGTTCG TCGCCGCCGA GTTCTCCCTG ACCGCGCTCG AACGCAGCAC CGTCGAGGCC AACGTCCGCT CAGGGGACCG CCGCGATGTG ATGGTGCAGC GTGCCCACCG CACGCTGTCG ACGCAGTTGT CCGGAGCGCA GGTCGGCATC TCCATCACCA CGCTGGCCAC CGGTTTCCTG GCCGAACCCG TCGTCGCCCG GCTGATCCAC CCCGGCCTGA CCGCCATCGG GATCCCGGAC CGGTTCGTCG GCGGTCTGGC GTTGACCCTC GCCATCCTGA TCGCCACCTC GATCTCGATG GTGTTCGGCG AGCTGGTGCC CAAGAACCTC GCGGTGGCCC GCCCGGTGCC GACCGCGCGG TGGTCGGCAC CGCTGCAGCT GATGTTCTCG TTCCTGTTCA CGCCCCTGAT CCGGCTGACC AACGGCACCG CGAACTGGAT CCTGCGGCGG CTCGGCATCG AACCGGCCGA GGAACTGCGC TCGGCACGCT CACCCCAGGA GCTGGTGTCG CTGGTGCGGT CCTCGGCCGA ACGCGGATCG CTGGACCCGG TCACGGCGCT GCTGGTGGAC CGCTCCCTGC AGTTCGGCGA CCGCTCCGCC GAAGAGCTGA TGACGCCGCG GTCCAAGATC GACACGCTGG AGGCCGACGA CACGGTCGCC GACCTCAGCG ACGCCGCGAC CCGAACGGGC CACTCCCGCT TCCCCGTCAT CCGCGGTGAC CTCGACGAAA CCGTCGGCAT GGTGCACGTC AAACAGGTGT TCGCCGTGCC GGCCGACGCC CGCGCGACAA CCAGGCTGGC CACCCTGGTC CAGCCCGTCA CCAAGGTGCC TTCGACGCTC GACGGGGATG CGGTGATGTC GGAGGTGCGC GCCAACGGTC TGCAGACCGC GTTGGTGGTC GACGAATACG GCGGCACCGC GGGCATGGTG ACGGTCGAGG ATCTGATCGA GGAGATCGTC GGCGATGTGC GCGACGAACA CGACGTCGAA CCGCCCGACG TGGTGCAGGC CGGCCGTGGC TGGCAGGTCT CCGGTCTGCT GCGCATCGAC GAGGTGGCTC AGGGCACCGA GTTCCGGGCA CCTGAAGGCG ACTACGAAAC CATCGGCGGT CTGGTGCTGG AGAAGCTCGG CCACATACCG GAGGAAGGCG AGTCGGTGGA GCTGATCGCC TTCGACCCGG ACGGCCCGAT CCAGGATCCG GTGCACTGGC TGGCGACCGT GGTCAAGATG GACGGCCGCC GCATCGACCA GCTGCGGCTG ACCGAACTCG GCCGCAAGGG AGACAGCCGT GGGTGA
|
Protein sequence | MSSNLSLLAL LAFMLLTAGT AVFVAAEFSL TALERSTVEA NVRSGDRRDV MVQRAHRTLS TQLSGAQVGI SITTLATGFL AEPVVARLIH PGLTAIGIPD RFVGGLALTL AILIATSISM VFGELVPKNL AVARPVPTAR WSAPLQLMFS FLFTPLIRLT NGTANWILRR LGIEPAEELR SARSPQELVS LVRSSAERGS LDPVTALLVD RSLQFGDRSA EELMTPRSKI DTLEADDTVA DLSDAATRTG HSRFPVIRGD LDETVGMVHV KQVFAVPADA RATTRLATLV QPVTKVPSTL DGDAVMSEVR ANGLQTALVV DEYGGTAGMV TVEDLIEEIV GDVRDEHDVE PPDVVQAGRG WQVSGLLRID EVAQGTEFRA PEGDYETIGG LVLEKLGHIP EEGESVELIA FDPDGPIQDP VHWLATVVKM DGRRIDQLRL TELGRKGDSR G
|
| |