Gene Mvan_1172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1172 
Symbol 
ID4646561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1244661 
End bp1248023 
Gene Length3363 bp 
Protein Length1120 aa 
Translation table11 
GC content68% 
IMG OID639804670 
Producthypothetical protein 
Protein accessionYP_952013 
Protein GI120402184 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.381954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTGAGC AATTCCACCT GTCGCGGCTC CAGGTCATCA ACTGGGGCGT GTTCGACGGC 
TACCACGACA TCCCGTTCAG CGAGGGCGGC GCACTCATCG CGGGCGCCTC GGGCAGCGGC
AAATCCTCAC TGCTGGACGC GATCTCGCTC GGCTTCCTGC CGTTCAACCG ACGCAACTTC
AACGCCTCCG GCGACAACAC CGCAGCAGGA TCCAGCGCGG GCCGCCGCAC CGTCGACAAG
TACGTGCGCG GCGCGTGGGG CCAGCGCAGC GACGGCGGCA CCAGCCGGGT GATGTACCTG
CGCGGCGACG GCACCGCCTG GTCGGCGGTG GCCGTCACCT ACGCCGGCGA CTCCGGACGC
ACCGTGACGG GCCTGGTGCT CAAGTGGCTG ACCGGTGAAT CGCGCAACGA CTCGTCGAGC
CGTTTCGTAC TCGGCGACGG CGACCTCGAC ATCGAGGACG TCTGCAACCG TTGGGCTGCA
GGACGATTCG ACACCGGCGT GTTCAAGGAA GACGGCTGGC GGTTCACCAC CAAGGTGGAG
TCGCAGTACC TGGCGCAGCT GTACGCGACC ATCGGCATCC GCGCCTCGGA TGCGGCCCAA
CAGCTGCTCG GCAAGGCTAA ATCGCTGAAA AGCGTTGGTG GACTGGAACA ATTCGTCCGC
GAGTTCATGC TCGACGAGCC CGAGAGTCTG ACCCGGCTGC CGGAGGCGCT CAAGCAGATC
GACCCGCTGG TGGAGGCGCG CGAACTGCTG GCGGTCGCGC AGAAGAAGCG CAAGATCCTC
GGCGACATCG AGAAGATCCA GCAGCGCTAC GCATCCGAGT CCACCGATCT CGGCATCATC
GACCTGGTCG ACCTGCCGAT GGTGCGCGCC TACACCGACC ATGTCCGGGT GGCGCAGTGC
CCGGCGCAGA TCGCGCAGCT CGACACCACG ATCGATCAGC TCGACAATGA GTACGAGGAC
GTCACCCGAA GCCTGAATCT GGCCAAGGCC GAAGCAGATT CGCTCAACGC GCAGATCAGC
GGGTCCAGCG CGAGCATCGG TCCGCTGCAG TCGCAGGTGA CCGCCGCCGA GACCGAGGCC
GAGCAGGTGT CGCGCCGGCG CGGCGCGTAC GAGGACATGC TCGCCGCCCA GCAGCTCGAC
GTGCCGGAGA CGGCCGACGA CTTCTGGAAC CTGCGCGAAG AACTGCTCGC CCAGGCCACC
GAACTGCTGG CCAAGGTGGA GCGCAACCGT GAGGCCTCCA CCGACGCCGA GTACGCGCAG
AAGTCGGCGC GGATGGCCCG CGACGAGGCC GCCAAGGAAC TCAAACGTGT CGAGCACGTC
GGCTCGGCGC TGCCGGAGTT CGCGCTGACC ATGCGGGAAC AGATCTGCAA TGCGGTCGGT
GTCGACTCCA CCGACCTGCC GTATGTCGCC GAACTGATGG ATCTCAAACC GGACCAGACC
CGCTGGCGCA CCGCGGTGGA GAAGGTGCTG CGCGGTGTCG GACTGCGGCT GATGGTGCCC
GATCAGCACT GGACAAAGGT GCTGCAGTTC GTCAACGAGA CGAACATGCG GGGACGGCTG
CAGCTGCACC ATGTGCGGGC GAAGTTCCTC GGCGCCGAGC CGGTCGATCC GGAGCCGAAC
ACGTTGGCGG CCAAGCTGTT CGCGGTCGAC CCGGCCCACC CGTGCGCCGC CGAGGCCGTC
GACGTGGTCA CCGCCGCCGG CGACCACGTC TGCGTCGACA CCCCCGAGGT GTTCGCCCGG
TTCCGCCGCG CGGTCACCGA CACCGGCCTG TACAAGGATT CCGACCGGCT CGCGATCAAG
GACGACCGCC GCCCACTCAA GCAGTCCGAG TACCTGTATC AGGGTGACGT GTCGGCGAAG
ATCAACGCAC TGACCGTCGA CCTGGCCGCG GCCGAGGAGG CCTATCAGAA GGCGCGGCGC
GTCGCCGACG ACATCGCCGC GCAGCGCCAG ACCTGGCGGG ACCGGGCCGC GGCGTGCAAG
GCGATCTGCG AGCAGTTCCC GCAGTGGAGC CAGATCGACA CCGAGACCGC CGACGGGCAC
GCCGACCGGC TGCGCGAGCA GTACGAGCTG CTGCTGGCCG ACCACCCCGA CATCGAGGCG
CTCAACGCCC GCGCCGACGA ATGCTGGTCG CAGATCCAGA AATTGATGAC GCGCCGGGGT
GCGATCCAGA CCCGCCGCGA CGACCTCGAC TCCCGCCGTA CGCAGCTCCT CGAACTTCAG
GAGCGGCTGC AGCCGGCATT CGTCTCGGAG CCGCTAACCG ACCTGCTGAG CCGCTACGCC
AACCAGGTGC CGGTGAGCCT GGAGCTGCTG GACCCGGAGC CGCACCGCGA TGCGTTGTTC
ACCGCGATCA AGAAGGAACG CGAACAGCTG CGCGAGAGCC GGCGCCGCTC CTACGACGAG
CTGGCCCGCA TCCTCAACAC GTTCGACACC TCGTTCCCGG ACGCGATCCC TAACGACTCG
GACAACTTCG ACGAGCGGGT GCACGACTAC GTCGCGCTGT GCCGGCACAT CGACGAGCGG
GAGCTGCCCG AGGCCTACGA GCGGATGATG CGTCTGGTCA CCGAGCAGGC GCCGGATGCG
ATCCTGACGC TGCACCGGGT GGCCGAGCAG GAAACCCGGC GGATCAGTGA CCAGATCGAC
CGTGTCAATA CGGGTTTGGG ATCGGTGGAG TTCAACCGCG GCACCCGGCT GACGCTGCGG
GCCACGCCGC GCAGCCTGAC GGCGGTGTCC GAGTTGACCG AGATCGTGCG GGCCATCTCG
CGGCGCATCG CCGAGGTCGG GCTCGGCGAC AAGCAGGCGA TCCTGGATCA GTACGCCGAC
ATCCTGCGGC TGCGTAACCG GCTGGCGTCG ACGGCGCCGG AGGACAAGGC GTGGACCCGC
GACGCGCTCG ACGTGCGCAA CCGGTTCACG TTCGACTGCG CCGAGTGGGA TGTCGCCAGC
GAGGAGCTGA TCCGCACGCA CTCCAACGCC GGCGACAACT CCGGCGGCGA GCAGGAGAAG
CTGATGGCGT TCTGCCTGGC CGGTGCGCTG AGCTTCAACC TGGCCAGCCC CGACAGCACC
GACAACCGGC CGGTGTTCGC GCAGCTGATG CTCGACGAGG CGTTCTCCAA GTCGGATCCG
CAGTTCGCGC AGCAGGCACT GCAGGCGTTC CGCAAGTTCG GGTTCCAGCT GGTGATCGTC
GCGACGGTGC AGAACGCGAC GACGATCCAG CCCTACATCG ACAGCGTGGT GATGGTGTCC
AAGACCGAGG CGACGGGCCG CAACGCACGT CCGGTGGCGA CGGTGGCGAC GCGCACGATC
TCCGAATTCG GCGAGCTGCG CCGCGAGATG CGGGCCGGCG CGAAGGTGCC CGCCCCGGCC
TGA
 
Protein sequence
MTEQFHLSRL QVINWGVFDG YHDIPFSEGG ALIAGASGSG KSSLLDAISL GFLPFNRRNF 
NASGDNTAAG SSAGRRTVDK YVRGAWGQRS DGGTSRVMYL RGDGTAWSAV AVTYAGDSGR
TVTGLVLKWL TGESRNDSSS RFVLGDGDLD IEDVCNRWAA GRFDTGVFKE DGWRFTTKVE
SQYLAQLYAT IGIRASDAAQ QLLGKAKSLK SVGGLEQFVR EFMLDEPESL TRLPEALKQI
DPLVEARELL AVAQKKRKIL GDIEKIQQRY ASESTDLGII DLVDLPMVRA YTDHVRVAQC
PAQIAQLDTT IDQLDNEYED VTRSLNLAKA EADSLNAQIS GSSASIGPLQ SQVTAAETEA
EQVSRRRGAY EDMLAAQQLD VPETADDFWN LREELLAQAT ELLAKVERNR EASTDAEYAQ
KSARMARDEA AKELKRVEHV GSALPEFALT MREQICNAVG VDSTDLPYVA ELMDLKPDQT
RWRTAVEKVL RGVGLRLMVP DQHWTKVLQF VNETNMRGRL QLHHVRAKFL GAEPVDPEPN
TLAAKLFAVD PAHPCAAEAV DVVTAAGDHV CVDTPEVFAR FRRAVTDTGL YKDSDRLAIK
DDRRPLKQSE YLYQGDVSAK INALTVDLAA AEEAYQKARR VADDIAAQRQ TWRDRAAACK
AICEQFPQWS QIDTETADGH ADRLREQYEL LLADHPDIEA LNARADECWS QIQKLMTRRG
AIQTRRDDLD SRRTQLLELQ ERLQPAFVSE PLTDLLSRYA NQVPVSLELL DPEPHRDALF
TAIKKEREQL RESRRRSYDE LARILNTFDT SFPDAIPNDS DNFDERVHDY VALCRHIDER
ELPEAYERMM RLVTEQAPDA ILTLHRVAEQ ETRRISDQID RVNTGLGSVE FNRGTRLTLR
ATPRSLTAVS ELTEIVRAIS RRIAEVGLGD KQAILDQYAD ILRLRNRLAS TAPEDKAWTR
DALDVRNRFT FDCAEWDVAS EELIRTHSNA GDNSGGEQEK LMAFCLAGAL SFNLASPDST
DNRPVFAQLM LDEAFSKSDP QFAQQALQAF RKFGFQLVIV ATVQNATTIQ PYIDSVVMVS
KTEATGRNAR PVATVATRTI SEFGELRREM RAGAKVPAPA