Gene Mvan_2224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2224 
Symbol 
ID4644946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2373747 
End bp2376743 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content65% 
IMG OID639805708 
Producthypothetical protein 
Protein accessionYP_953044 
Protein GI120403215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.134781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCTC ATGCGCGACG CCGGCAGACC AGTGCAGCGA CGAAGGGGTC GCGTCGGGTG 
GACCGACAGT CACGGATCGA GCCCTATGCC TGGCTGGGAG CGGGTGCGGT CACGCTGGGC
ATCGGGGCGG CTTTGGTCGG CGGAGCCGGT GTCGCGCACG CCGAGGACGG AGCGGACGGC
GGCTCGCCGT CGACGTCCAC CTCGAGCTCG ACCTCCGAAA GCAACTCGAG CACTCAGCAC
TCGGACGCCG CGGGCGATGG TGACACCGAC AGTGCTACGC AGCAGCCGAG GAGTGACCTG
CCGTCGTCGG GCCGCGTGGA AGACCCAGAG CCGCAACAAG AGCCTGATCC GAACGGTGCA
TCGAGCTCGG TGGTGGACTC CGCGGAATCG AGCGCGTCTG ACCCGGAACT CCCACCCGAT
GATGGCGGGA CGGAATCCGA AGAGGCAGCG TCCGATTCGG CACCGGCACG TGCCGGCACG
CGCACCGACA ATTCAGATGC TTCGACTCCG GTCGAGGGAG GCAGCGACGG CCGGAAGTCC
GCTGCGCAAG CCGACGACCA CAGTGCGGAG CGGACCGCGT CACCAAACAC GGCGGCGACA
GACGCGACGT CGAGCATTCC GCCGGAGGAC CCCCCGGTCG GCAACGTGGG CATCGACGTC
GCGGAAGCAG AAGCTGACCC CGACGGCGCC GCGACGTCCT CGACGCAATG GCTTGCGGCG
GCCACCTCGC AACACGAAGC GGCCGACACC ATGGTTGCGG CCATCGTTCC CATGCCCCCA
GCCGGTGTCT CGACACAGAC GGTGAATCCG AGTCGCCCGG TCAACCGCTT CGCGCTGGCG
GTCCACACTC TTCTGAAACC CATCGGGAAG CTTCTGACCG AGCTGGGGGG ACTCATCCTG
GGCGCCCGCG CCGACCCGGT CCCTCCCGGC GCATGCCGGG ACGGCGTCTG CAATCCCACC
ACTGACATCC CACTTCCCTG GGTTCGCAAC GAAGTCACCG TCCTGAACCT GACCGGCAGG
CGGGTCACCC TGACCGACCT CGACACCAAG CTTCCGCTGA CCTACGGCCC GCAGGAAGGG
TTCGTCCTCG AGAATGCGCG CCAGGTCGAG CTGGCGTTCT ACCAGGGCAA CAGTTACTGG
GACGAAGACT GGACCTACGA GGGCACGATG AAGTGGTCGG ACGGATCCAC GATCGTCAGC
GTGGACGTCG ACGCAGCCGG CGCCATTGCG TCGACCTCCG ACAGCGGCGT GCAGACCGTC
ATCTTCGAGA CTCCGGAATC CATCGGCGGC CCGTCGCTCA TCTCACTCCG GCAGACGTTG
GTGCTCCTGC CGGAGGCCGG CACCGTGATC ACCATCGACA GCACGGACCC AGTCGGGCAA
GCGCTCGTCG CCTCGGCCCT GTGCAAGGTG TCGGGCGGCT GCACTCAGGA AGTCGTCGAC
GAGCAGGTCA TGCTCAGCGC GCCGAAGCAG GTCGGCAATA CACTGTTCAA CGCCGGAAGC
GTCGTCTCGA CCAACAGGTA CAAGGTCGTC CACGAGGTGA CCAAGACCAG CGGTGTCGAG
GAAAACCTGA AGGTGGTCGC CGGCACCTCA TTGAATTTCT CCCTCGGGCC GCTCGCCTTC
CAGACCGAAA TCGGCGCGCT GGTGCAACAG AAGTACGGGC ACAGCTGGAG CGACGGCATC
ACCACCGAGT CCCAAGTGGA TCTGGATGTG CCGCCCGGAA GCTACGGCCA GATCTACGTG
CAGTACCCCG AATATCACGA CTACGTCAAC ATGACGTTGA CCAATTCGGG TGTCACCATC
ACGATTCCCG ACGTCGAGTA CGTCAGCCTG GCACCCTCGG GCGCCCTCGA CAAGGAGGGG
AACCCGGTCG CGGTCACCTA CTCCACTGTC GACTGGGAGA TCGGGACCGG CCCACACCCC
TACCCCGATT CCAATTCAGA GCCGACGCCT CCGGAGACCG CTGCCGTCGG CTTTGTGACC
CCGCCCGACA TCGCGGCGCC CGCCCCGGCG CCCACTGCCA AACCGAAGTC CCTGTTCGGC
ATCATCGGCG GCTATCTGCG GGATCAGGTT CGGGCGTTCC GCATGCTGCC GAACATCGTG
TTCGCCGGCC GAGCCATCAG CTCACAGACG ATCCGGGTCG TCAACCTCAC GCCCTACGCG
CAGACGCTCA GCAGCATCAC CGGCGAATAC GAGGAAGACG ACTCCCCGGA GAAGGGCTTT
GTCCTGCAAC CCTTCCAGGA GATCGACATC CAGGTGGATT ACAACGTCTT CCAGGACCAA
GAGACGTACG TGACGTGGAC CAACGCGACG GGTATCGCCG CCTCGGCGGA GCTGAAGGTT
TTCAACGGGG GCAGTGCACC CCGGGTCACC TGCCAGAGCG ACGGTTGCAT GGCGGGCAGC
TACGACCGCG ACAGCGCCGT CATGTACCTG ATCTACCCGT ATGACACACC GGGCCGAATG
GACGTCACGA ACGACCCGAA CCTCGCGGCT GCAGCGGTCG ACGTGGCGTG CGCGCCGGGC
TATGACGACG CGATGCCCGG CTCGTGCGGG GTGAACGTCA CCGGCGACAC CTACTACAAC
GCGCCGACAT CGGGACCCGT CCAGCAGCAG AACAACTTTG GCTCCCAGAC GAACTCGTAC
TACTACACCA TCACCACCAC GAAGAGTGAG TCGGCGAGCT GGGCCGCCGG CGGGGGCGTC
AAGCTCAAGG AGAAGGCCGG TGTGTTCGTC GGCCTGCAGA TCGAGATCGA GGCATCCGTG
CTGTACAACA GTTCGGTCCA GGTCAAGGAC TCGGAGTCGA GCACAGTCGT ACAGAACTTG
CTCCCGGATT ACGGGGGCGC CATCTACATC GGCGATCCCT ACCTCCGCAC GTACGGCGAC
TACATCGTCA ACCTGCCCAA CCTCACCATG GTCGTCACCG GCCAGTGGAT CGAGGCGGCG
TCCGGGCTCG CCGCTCAGGG ACCGGTGGCC AACGTCGTCG ACTACCCGCT GAGCTAG
 
Protein sequence
MSAHARRRQT SAATKGSRRV DRQSRIEPYA WLGAGAVTLG IGAALVGGAG VAHAEDGADG 
GSPSTSTSSS TSESNSSTQH SDAAGDGDTD SATQQPRSDL PSSGRVEDPE PQQEPDPNGA
SSSVVDSAES SASDPELPPD DGGTESEEAA SDSAPARAGT RTDNSDASTP VEGGSDGRKS
AAQADDHSAE RTASPNTAAT DATSSIPPED PPVGNVGIDV AEAEADPDGA ATSSTQWLAA
ATSQHEAADT MVAAIVPMPP AGVSTQTVNP SRPVNRFALA VHTLLKPIGK LLTELGGLIL
GARADPVPPG ACRDGVCNPT TDIPLPWVRN EVTVLNLTGR RVTLTDLDTK LPLTYGPQEG
FVLENARQVE LAFYQGNSYW DEDWTYEGTM KWSDGSTIVS VDVDAAGAIA STSDSGVQTV
IFETPESIGG PSLISLRQTL VLLPEAGTVI TIDSTDPVGQ ALVASALCKV SGGCTQEVVD
EQVMLSAPKQ VGNTLFNAGS VVSTNRYKVV HEVTKTSGVE ENLKVVAGTS LNFSLGPLAF
QTEIGALVQQ KYGHSWSDGI TTESQVDLDV PPGSYGQIYV QYPEYHDYVN MTLTNSGVTI
TIPDVEYVSL APSGALDKEG NPVAVTYSTV DWEIGTGPHP YPDSNSEPTP PETAAVGFVT
PPDIAAPAPA PTAKPKSLFG IIGGYLRDQV RAFRMLPNIV FAGRAISSQT IRVVNLTPYA
QTLSSITGEY EEDDSPEKGF VLQPFQEIDI QVDYNVFQDQ ETYVTWTNAT GIAASAELKV
FNGGSAPRVT CQSDGCMAGS YDRDSAVMYL IYPYDTPGRM DVTNDPNLAA AAVDVACAPG
YDDAMPGSCG VNVTGDTYYN APTSGPVQQQ NNFGSQTNSY YYTITTTKSE SASWAAGGGV
KLKEKAGVFV GLQIEIEASV LYNSSVQVKD SESSTVVQNL LPDYGGAIYI GDPYLRTYGD
YIVNLPNLTM VVTGQWIEAA SGLAAQGPVA NVVDYPLS