Gene Mvan_1133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1133 
Symbol 
ID4648568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1202906 
End bp1206277 
Gene Length3372 bp 
Protein Length1123 aa 
Translation table11 
GC content65% 
IMG OID639804632 
Producthypothetical protein 
Protein accessionYP_951975 
Protein GI120402146 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.802022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAT CACGCTGCGA TGCAACCCAA CCCGGGGCCG GGTACCGACT CCAGTACGCC 
GAGGTCTACA ACTGGGGAAC GTTCGACGAC CATTCCTGGC GGTTCACCCC CGGCACCGAC
ACCGCGCTGC TGACCGGCGA CATCGGATCA GGGAAATCGA CAATCGTCGA CGCGCTCACC
ACATTGCTGG TGCCCGCGCA CAAGGCCGCC TACAACAAGG CCGCCGGCGC CGACGCCAAG
GAGCGCACGC TGCGCTCCTA TGTCGAGGGC CACTACAAAT CCGAGCGCAA CGAATCCACC
GGTAGATCCC GTCCGAAAGG CCTACGAGAG AACAAGCGCA CCTACTCGGT GATTCTCGGT
GTGTTCCGCA ACCACGGTCA CGACGAGACG GTCACCCTGG CTCAGGTGTT CCAGCAGCGC
GAGAGCACCG GGCAGCCCTA CCGGTTCTTC GTGACCGCCA CCAAGGAGCT GTCCATCGCG
ACCGACTTCG CCGACTTCGG CACCGATCTG CGCGAACTGC GCAGACGGCT GCGTGGCGCC
GGGGCCGAGA TCTTCGATGA GTTCCCGAAG TACTCCACCT CGTTGCGCCG CCTGCTCGGT
ATCCGCTCGG AGCAGGCCCT CGAACTGTTC CACCAGACGG TGTCGATGAA GTCGGTCGGG
AATCTCAACG ATTTCGTCCG TGATCACATG CTCGAACCCA GCGACTCGAC CGAACGCGTC
CGCGAGATCA TCGGACATTT CGAAGATTTG ACCAAAGCGC ACGACGCCGT CAAACGCGCC
CGCGAGCAAC TCGAAGCGCT GCAACCGATC GTGGACACCG CAGCGAAATA CGACGCGGCA
CTAGCCGAAC GTGCGGGTTT GGAGCTCGAG CGCGCGGCGG TCCGGCTGTT CATCGCCGAG
CTGCGCTCAG GGTTGCTCAC CGACGAGATC GCGCGGCTGG AAGCCGACGG AGCGGCGTTG
CTGACGCAAC TGGACACCGC CGAAGCCGAA CAGCGCAGGC TCGGCCGCGA GCGCGACTCG
CTGATCGAAG AGCGCGCCAA AGCGGGCGGT GACCGCATCG GCGAGCTGGA GCGACTTGCC
GCTGACGCCC GCGAGCAGGC GAAAACGCGA AGCCAGACAA AGGCTTTGTT CGACGCCGCA
GTGGCAGAGG CCGGACTGGA GCCCGTCGCC GACGGTGACG CCTTCGCCGC GCTCGGCGCC
GTGGTCGCCG GCGAACGCCC CCGGCTGACC GGTGAGAAAC GCGACCTCGA CACCGCGACC
GTTGACGCGA TCGGTCGCGA GCGGGAGCAC CAACGCAGGT GCGATGTGAT CGCCGAGGAG
GTCGCGAGCC TGGAGCAGCG TACCGACAAC CTGCCACAGG AGCAGGTGGT GGTGCGGGCC
GAGTTGTGCG CGGCGCTGGG GTTGACGCCG GACGACCTGC CATATGCCGG CGAGTTGCTC
GATGTCCATG ACGAGCACGC GCAGTGGCGG GGCGCCGCAG AACGTGTGCT GCGGGGGTTC
GCGCTGTCAT TGCTGGTGCC GCAGCGGCAC TACGACGCCG TCACCGCATG GGTGAACGGG
CGCAGGCTCA CCGTCGGCGG CCGCGGCGCC AAACTGGTCT ACGAACGCGT CCCCCAACAT
CGGGTACGAC TGCAGCCGAC GGCACACGAC GGCTTGTTGC TGGCCGACTG CATCGAAGTC
CGAGACGGGC AGTTCGAGGA ATACCTTCGC GCCGAGCTGA TGAAGCGCGC CGACTTCCGC
TGCGCAGCAA CACTTGACGA GTTCCGCACC GAGCGTCGCG CCGTCACCCG AGAGGGCCAG
GTGCGCTCGG GGGACCGGCA CGAGAAGGAC GACCGGCACC GGGTCGACGA CCCCAGACGG
TGGGTACTGG GCTGGGTCAA CGAACGCAAG ATCGCCGCGA TGCGCGCGGA ACTGGCCGAG
CTGGAAAGCC AACGTGACGA GGCCGCGGCA CAAGCCGCGC GGCTGGTAAA AGAACGCGAC
GCCCTGCAGT GTCGACTGGA TGCGTTCCGG AGTGTCGAGG GGTTCCGCTC CTGGGGCGAA
CTCGACGCCG ACGAGGCCGA GTCGCGTGCG AAAGCGCATG ACGCCGAGAG GGTTCGGCTT
CAGGCCGGGT CGAATCGACT GGCGGAGATC ACGCAGGCGC TGGAGCGCAA CGCCGAGAAC
GCGGTCACAG TAACGGATCT GATCAAGAAG CTCACCGGCA TGCTGGCGAC CGCGCAATCG
AGAATGAATC AGGCCAAGCA GGAGCGAAGC CGCGACGACG AGTTCGTCGC TGCGCATGCA
CCGGATCAAC GGGAGAAGGC CCGTGCGTCG TACCCGGCCC TGACCGCGCG ACTGGCGGAT
AGCCCGCCCG CCCGTGCGGC GGACTGCGCC GACTCCGAGG CGGCATTGTC CGATGACCTG
CACCGGCGCA TCGAGCGGTT GTCCGGTCAG CTCAACGGGC ACGCATTGAA TCTGACACAG
CACATGACGG CAGTCCTGAA CCGATGGCAA GAGCTGCGGG CGGACATGGA CGTCAATGTC
GAATCCCGAG CAGACTTCCT GGCCTTCCGG GAGCGGGTCG CCACCGACGA CCTGCCCCGG
TTCGAGAGCG AGTTCAAAGA ACAGCTCAAC AAGAACGCCA TCCAAGAGTT GGCGGGGTTC
AATAACTGGT TGGGCCGGCA GGCGTCGGCC ATCGACGAGC GCGTTGACCG AATCAACGAC
GCCCTCGGTG CGGTGCCCTA CAACCCGGGC CGATACATCA AGCTGGAAAA GGAACCGACC
AGCAATCAGG ATGTCGCGCA GTTCCGCTCC GATCTCCGCA ATCTCACCAA CGACACGCTC
ACCGCGGACG GCGACCAGTA TTCCGAGCAG CGCTTTCTGG ACGTGAAGCG GATCATCGAG
CGCTTCCGGG GCCGCGACGG CTACGCCGAA TCGGACAAGA ACTGGACGCG TCGCGTTACC
GATGTGCGTA ACTGGTTCGT GTTTTCGGCA TCCGAGCGTG ATGTCGACAC CGATCTCGAG
TGGGAGCACT ACAGCGACTC CGACGGCAAG TCCGGCGGGC AGAAAGAGAA GCTGGCCTAC
ACCATTCTCG CGGCATCGCT GGCCTACCAG TTCGGTCTGG AGTGGGGCGT CGAGCGATCA
CGCGACTTCC GGTTCGCGGT CATCGACGAG GCTTTCGGGC GGGGTTCCGA TGTCTCCACA
CGGTATGCCC TCGATCTATT CGCGACTCTC GGTCTGCAGC TGCTGATCGT GACACCCTTG
CAGAAGGTGC ACGTCATCGA GCCCTATGTG AAATCGATTG GCATCGTCGA CAATCCGACC
GGAACCTATT CGCGGTTGCA GACGATGACC ATCGAGGAGT ACCGGGACCG GCGAGACCGG
CCACTACGGT GA
 
Protein sequence
MTESRCDATQ PGAGYRLQYA EVYNWGTFDD HSWRFTPGTD TALLTGDIGS GKSTIVDALT 
TLLVPAHKAA YNKAAGADAK ERTLRSYVEG HYKSERNEST GRSRPKGLRE NKRTYSVILG
VFRNHGHDET VTLAQVFQQR ESTGQPYRFF VTATKELSIA TDFADFGTDL RELRRRLRGA
GAEIFDEFPK YSTSLRRLLG IRSEQALELF HQTVSMKSVG NLNDFVRDHM LEPSDSTERV
REIIGHFEDL TKAHDAVKRA REQLEALQPI VDTAAKYDAA LAERAGLELE RAAVRLFIAE
LRSGLLTDEI ARLEADGAAL LTQLDTAEAE QRRLGRERDS LIEERAKAGG DRIGELERLA
ADAREQAKTR SQTKALFDAA VAEAGLEPVA DGDAFAALGA VVAGERPRLT GEKRDLDTAT
VDAIGREREH QRRCDVIAEE VASLEQRTDN LPQEQVVVRA ELCAALGLTP DDLPYAGELL
DVHDEHAQWR GAAERVLRGF ALSLLVPQRH YDAVTAWVNG RRLTVGGRGA KLVYERVPQH
RVRLQPTAHD GLLLADCIEV RDGQFEEYLR AELMKRADFR CAATLDEFRT ERRAVTREGQ
VRSGDRHEKD DRHRVDDPRR WVLGWVNERK IAAMRAELAE LESQRDEAAA QAARLVKERD
ALQCRLDAFR SVEGFRSWGE LDADEAESRA KAHDAERVRL QAGSNRLAEI TQALERNAEN
AVTVTDLIKK LTGMLATAQS RMNQAKQERS RDDEFVAAHA PDQREKARAS YPALTARLAD
SPPARAADCA DSEAALSDDL HRRIERLSGQ LNGHALNLTQ HMTAVLNRWQ ELRADMDVNV
ESRADFLAFR ERVATDDLPR FESEFKEQLN KNAIQELAGF NNWLGRQASA IDERVDRIND
ALGAVPYNPG RYIKLEKEPT SNQDVAQFRS DLRNLTNDTL TADGDQYSEQ RFLDVKRIIE
RFRGRDGYAE SDKNWTRRVT DVRNWFVFSA SERDVDTDLE WEHYSDSDGK SGGQKEKLAY
TILAASLAYQ FGLEWGVERS RDFRFAVIDE AFGRGSDVST RYALDLFATL GLQLLIVTPL
QKVHVIEPYV KSIGIVDNPT GTYSRLQTMT IEEYRDRRDR PLR