Gene Mvan_5774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5774 
Symbol 
ID4643731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6165993 
End bp6167543 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content69% 
IMG OID639809250 
ProductHNH endonuclease 
Protein accessionYP_956545 
Protein GI120406716 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.300125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0760433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTCGAC CCCGAAGCCG CCGTGCTCGC CGAACTGGCC GCCGCCCAGA ACATCACCGC 
AGGGCTGACC TCAAGCCAAA CCCACCGCGG CGTCGCGCTG CGGGATCGGC TGCCCAACGT
GTTCGCCCTG TTCCTCGCCG GCCTGATCAG CGACCTGCTG GCCCGCACCA TCGTCTGGCG
CACCGCCCTG ATCACCGACC CCGCCCTGAT GGCCGCCGTC GACGCCGAAC TCGCCGCCCA
GATCACCACC TGGGGCCAGC TGTCGGCCGC CAGAACCGAA CTGGCCATCG ACGCCCTCGT
CGACCGCCAC GACCCCGACG GCGTCCGCTC CACCAAGAAC TCGCAGGTGT GCCGCACCCT
GGAATTCGGC ATCCCCGTCG ACAGCCCCGG ACTCACCACC ATCTGGGCCC GCATCTTCGC
GGCGACGCCG CAGCCGCCCA ACGCCGCATC GACGACATGG CCCACAGCGT CTGTCCCGAC
GACCCCCGCA CCCTCGACGA CCGCCGCATC GAGGCCTACA CCGCCCTACT CGCCGGCATC
ACCACCCTGA CCTGCCACTG CGGAAACGAC GACTGCGAAG CCACCGCCGC GCCCCGACCC
GGCCGCGACA CCACCATCTA CGTCCTCACC GACACCACCA CCGGCAACAC TCCTGCTGCC
GACCAGGCCG ACCAGAACAC CCGCGATCGC AACGAGGCTG AACCGGCCAC CGACACCACT
GGCAAGGCCA AGAACGAGAA CGAGGCTGCC CGTGAGGGCG AACGCGAGGA CGCGGCTGCC
TCCGAGGACC GCGACGGGAC CGAGCAGACG CCTGCGGCAA AGCAGAACGT CCGGCCGCAC
ACCGCGCAGT GCCGGTCGGC CTACGTGTTC GGCGCCGGCC TGGCCCCCAC CGCGCTGCTG
GAGGCCATGT GTGAGGGCGC CACGATCCGC GAGATCACCC ACCCCGGCCC CGACTCAGCC
CCCGAACCGC GCTACACCCC CTCACCGGCG CTGGCCGCGT ACATTCGCTG CCGCGACCTG
ACGTGCCGCT TCCCGCACTG CGACACACCC GCCACCCTCG CCGACATCGA CCACACCGTG
CCCTACCCGG TCGGACCCAC CCACCCGTCC AACCTCAAAA CCCTGTGCCG TTTTCATCAC
CTTCTGAAAA CGTTCTGGCT CGGCGCCACC GGCTGGCGCG ACCGCCAATA CCCCGACGGC
ACCATCGAAT GGACCGCACC CACCGGCCAC ACCTACACCA CCTACCCCGG CAGCCGACTG
CTCTTCCCCG CCCTGTGTGC ACCCACCGCC ACCCTCTGGA CCGGCGAACC ACCCCAAACC
ACCCTCAGCG CGCGGCGCGG GGCCATGATG CCGAAACGGC GAAACACCCG CGCCCACAAC
CGCTCCCGCT ACATCGAAGC CCAACGACGA CGCAATCGAT CCGAGAAGAT CTGCACCACA
CGATCAACGG ATATCGCCAG AGGACGCGAC ATCCTCTACC GCAACACTCT CCACCAATTC
CACCCGCCAG GGCACGAACC CGACTACGGG AACGACCCAC CACCCTTCTA G
 
Protein sequence
MGRPRSRRAR RTGRRPEHHR RADLKPNPPR RRAAGSAAQR VRPVPRRPDQ RPAGPHHRLA 
HRPDHRPRPD GRRRRRTRRP DHHLGPAVGR QNRTGHRRPR RPPRPRRRPL HQELAGVPHP
GIRHPRRQPR THHHLGPHLR GDAAAAQRRI DDMAHSVCPD DPRTLDDRRI EAYTALLAGI
TTLTCHCGND DCEATAAPRP GRDTTIYVLT DTTTGNTPAA DQADQNTRDR NEAEPATDTT
GKAKNENEAA REGEREDAAA SEDRDGTEQT PAAKQNVRPH TAQCRSAYVF GAGLAPTALL
EAMCEGATIR EITHPGPDSA PEPRYTPSPA LAAYIRCRDL TCRFPHCDTP ATLADIDHTV
PYPVGPTHPS NLKTLCRFHH LLKTFWLGAT GWRDRQYPDG TIEWTAPTGH TYTTYPGSRL
LFPALCAPTA TLWTGEPPQT TLSARRGAMM PKRRNTRAHN RSRYIEAQRR RNRSEKICTT
RSTDIARGRD ILYRNTLHQF HPPGHEPDYG NDPPPF