Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_5774 |
Symbol | |
ID | 4643731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 6165993 |
End bp | 6167543 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639809250 |
Product | HNH endonuclease |
Protein accession | YP_956545 |
Protein GI | 120406716 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.300125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0760433 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTCGAC CCCGAAGCCG CCGTGCTCGC CGAACTGGCC GCCGCCCAGA ACATCACCGC AGGGCTGACC TCAAGCCAAA CCCACCGCGG CGTCGCGCTG CGGGATCGGC TGCCCAACGT GTTCGCCCTG TTCCTCGCCG GCCTGATCAG CGACCTGCTG GCCCGCACCA TCGTCTGGCG CACCGCCCTG ATCACCGACC CCGCCCTGAT GGCCGCCGTC GACGCCGAAC TCGCCGCCCA GATCACCACC TGGGGCCAGC TGTCGGCCGC CAGAACCGAA CTGGCCATCG ACGCCCTCGT CGACCGCCAC GACCCCGACG GCGTCCGCTC CACCAAGAAC TCGCAGGTGT GCCGCACCCT GGAATTCGGC ATCCCCGTCG ACAGCCCCGG ACTCACCACC ATCTGGGCCC GCATCTTCGC GGCGACGCCG CAGCCGCCCA ACGCCGCATC GACGACATGG CCCACAGCGT CTGTCCCGAC GACCCCCGCA CCCTCGACGA CCGCCGCATC GAGGCCTACA CCGCCCTACT CGCCGGCATC ACCACCCTGA CCTGCCACTG CGGAAACGAC GACTGCGAAG CCACCGCCGC GCCCCGACCC GGCCGCGACA CCACCATCTA CGTCCTCACC GACACCACCA CCGGCAACAC TCCTGCTGCC GACCAGGCCG ACCAGAACAC CCGCGATCGC AACGAGGCTG AACCGGCCAC CGACACCACT GGCAAGGCCA AGAACGAGAA CGAGGCTGCC CGTGAGGGCG AACGCGAGGA CGCGGCTGCC TCCGAGGACC GCGACGGGAC CGAGCAGACG CCTGCGGCAA AGCAGAACGT CCGGCCGCAC ACCGCGCAGT GCCGGTCGGC CTACGTGTTC GGCGCCGGCC TGGCCCCCAC CGCGCTGCTG GAGGCCATGT GTGAGGGCGC CACGATCCGC GAGATCACCC ACCCCGGCCC CGACTCAGCC CCCGAACCGC GCTACACCCC CTCACCGGCG CTGGCCGCGT ACATTCGCTG CCGCGACCTG ACGTGCCGCT TCCCGCACTG CGACACACCC GCCACCCTCG CCGACATCGA CCACACCGTG CCCTACCCGG TCGGACCCAC CCACCCGTCC AACCTCAAAA CCCTGTGCCG TTTTCATCAC CTTCTGAAAA CGTTCTGGCT CGGCGCCACC GGCTGGCGCG ACCGCCAATA CCCCGACGGC ACCATCGAAT GGACCGCACC CACCGGCCAC ACCTACACCA CCTACCCCGG CAGCCGACTG CTCTTCCCCG CCCTGTGTGC ACCCACCGCC ACCCTCTGGA CCGGCGAACC ACCCCAAACC ACCCTCAGCG CGCGGCGCGG GGCCATGATG CCGAAACGGC GAAACACCCG CGCCCACAAC CGCTCCCGCT ACATCGAAGC CCAACGACGA CGCAATCGAT CCGAGAAGAT CTGCACCACA CGATCAACGG ATATCGCCAG AGGACGCGAC ATCCTCTACC GCAACACTCT CCACCAATTC CACCCGCCAG GGCACGAACC CGACTACGGG AACGACCCAC CACCCTTCTA G
|
Protein sequence | MGRPRSRRAR RTGRRPEHHR RADLKPNPPR RRAAGSAAQR VRPVPRRPDQ RPAGPHHRLA HRPDHRPRPD GRRRRRTRRP DHHLGPAVGR QNRTGHRRPR RPPRPRRRPL HQELAGVPHP GIRHPRRQPR THHHLGPHLR GDAAAAQRRI DDMAHSVCPD DPRTLDDRRI EAYTALLAGI TTLTCHCGND DCEATAAPRP GRDTTIYVLT DTTTGNTPAA DQADQNTRDR NEAEPATDTT GKAKNENEAA REGEREDAAA SEDRDGTEQT PAAKQNVRPH TAQCRSAYVF GAGLAPTALL EAMCEGATIR EITHPGPDSA PEPRYTPSPA LAAYIRCRDL TCRFPHCDTP ATLADIDHTV PYPVGPTHPS NLKTLCRFHH LLKTFWLGAT GWRDRQYPDG TIEWTAPTGH TYTTYPGSRL LFPALCAPTA TLWTGEPPQT TLSARRGAMM PKRRNTRAHN RSRYIEAQRR RNRSEKICTT RSTDIARGRD ILYRNTLHQF HPPGHEPDYG NDPPPF
|
| |