Gene Mvan_1814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1814 
Symbol 
ID4644070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1926410 
End bp1929418 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content67% 
IMG OID639805302 
Producthypothetical protein 
Protein accessionYP_952642 
Protein GI120402813 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.693843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0180395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCCCG CGGCGAGAAT GCCGAATCTG ACGCGTCGAA GCCGGGTAAT GATCGCCGTC 
GCCCTGGCCG TCGTGGTGCT GTTGCTGTTG GGCCCGCGGC TCGTCGACAC CTACGTCAAC
TGGTTGTGGT TCGGGGAGCT CGGCTACCGA TCGGTGTTCA CCACCCAGAT CGTGACCCGG
TTGCTCCTGT TCCTGGCGGT GGCCGTCGTC TTCGGTGCCG TCGTGTTCGC CGCAATGGCG
TTGGCCTACC GCACCCGGCC GGTGTTCGTG CCGACCGCCG GGCCCAACGA TCCGATCGCG
CGCTACCGCA CCGCGGTGAT GGCCCGGCTG CGGCTGGTCG GCATCGGGGT TCCGGTCGCC
GTCGGCCTGC TGGCCGGCCT GATCGCCCAG AACTACTGGC AGCGTGTGCA GCTGTTCCTG
CACGGCGGCA GCTTCGGGGT GTCCGACCCG CAGTTCGGCA TCGACCTCGG CTTCTACGCG
TTCGACCTGC CGTTCTACCG CTTGATGCTG ACGTACCTGT TCGCCGCGAC GTTCCTGGCG
TTCATCGCGA ATCTGCTGGG TCACTACCTG TTCGGGGGCA TCCGGCTGGC CGGGCGCAGC
GGCGCGCTGA GCCGGGCGGC CCGCATCCAG CTGATCGCTC TGGTCGGGTT CCTGATGCTG
CTGAAGGCGG TCGCCTACTG GCTAGACCGC TACGAGTTGC TCAGCCATAC CCGCGGCGGC
AAGCCGTTCA CCGGAGCCGG GTACACCGAC ATCAACGCGG TGCTGCCGGC CAAGCTGATC
CTGATGGTCA TCGCGGTGAT CTGCGCCGCG GCGGTGTTCT CCGCGATCGT GCTGCGCGAC
TTGCGGATTC CCGCGATCGG TGTGGTGCTG CTGCTGCTGT CCTCGCTGAT CGTGGGTGCG
GGCTGGCCGC TGGTGGTGGA ACAGATCAGC GTGCGCCCCA ACGCCGCGCA GAAGGAAAGC
GAATACATCA GCCGAAGTAT CACCGCCACC AGACAGGCCT ACGGGCTGAC CGACGAGGCG
GTGGAGTACC GCGACTACCC CGGTAACGCC ACGGCGACGG CGCAGCAGGT GGCCGCCGAC
CGCGCCACGA CGTCCAACAT CCGGGTGCTC GACCCGAACA TCGTCAGCCC GGCGTTCACC
CAGTTCCAGC AGGGTAAGAA CTTCTACTTC TTCCCCGACC AGCTGAACAT GGACCGCTAC
CGCGACGAGG ACGGCAATCT GCGTGATTAC GTGGTGGCCG CCCGCGAGCT CAACCCGGAC
CGACTGATCG ACAACCAGCG TGACTGGATC AACCGGCACT CGGTGTACAC CCACGGCAAC
GGCTTCATCG CCTCGCCGGC CAACACCGTG CGCGGAATCG CCAACGACCC CAACCAGAAC
GGCGGTTACC CGGAGTTTCT GGCCAGCGTC GTGGGCGCCA ACGGTGAGGT CGTCTCGCCC
GGGCCGGCCC CGCTGGATCA GCCGCGCATC TACTTCGGCC CGGTGATCGC CAACACCCCC
GCCGACTACG CGATCGTCGG CGAGAGCGGC ACCCCGCGCG AGTACGACTA CGAGACCAAC
ACCGCCACCC GCAACTACAC CTACACCGGC AGCGGCGGCG TGCCGATCGG CAACTGGCTG
ACCCGCAGCG TGTTCGCCGC CAAGTACGCC GAGCGGAACT TCCTGTTCTC GAACGTCATC
GGCGAGAACA GCAAGATCCT GTTCAACCGT GACCCTGCCG ACCGGGTGGA GGCGGTCGCG
CCGTGGCTGA CCACCGACAC CGCGGTCTAC CCTGCGATCG TCAACAAGCG CATCGTCTGG
ATCGTCGACG GGTACACCAC GCTGGACAAC TACCCGTACT CGGAGTTGAT GTCGTTGTCG
TCGGCCACCA CCGACTCCAA CGAGGTGGCG CTGAACCGGC TGCAGCCCGA CAAGCAGGTG
TCCTACATCC GCAACTCGGT CAAGGCCACC GTCGACGCCT ACGACGGCAC CGTGACGCTG
TACGCCCAGG ACGAGCAGGA CCCGGTGCTG CAGGCGTGGA TGAAGGTGTT CCCGGACACC
GTCAAGCCCA AGGCTGACAT CACCCCCGAA CTGCAGGAGC ACCTGCGCTA TCCGGAGGAC
CTGTTCAAGG TGCAGCGCGC GCTGCTGGCC AAGTACCACG TCGACGACCC GGTGACGTTC
TTCTCGACGT CGGACTTCTG GGATGTCCCG CTCGACCCGA ACCCGACGGC CAGCAGCTAC
CAGCCGCCGT ACTACATCGT CGCCAAAGAC CTTGCCGAGA ACAACAATTC GTCGTCGTTC
CAGCTGACCA GTGCGATGAA CCGGTTCCGG CGCGACTTCC TGGCCGCCTA CATCAGCGCC
AGCTCGGATC CCGAGACGTA CGGCAAGCTC ACCGTGCTGA CCATTCCCGG TCAGGTCAAC
GGGCCCAAGC TGGCGTTCAA CGCGATCAGC ACCGACACCG CCGTCAGCCA GGACCTCGGT
GTCATCGGCC GTGACAACCA GAACCGGATC CGCTGGGGCA ATCTGCTGAC GCTGCCGATG
GGGCAGGGCG GATTGCTTTA TGTCGCACCG GTTTACGCCT CACCGGGCGC CAGCGACGCG
GCATCGTCGT ATCCGCGTCT GATCCGCGTC GCGATGATGT ACAACGACCA GATCGGTTAC
GGGCCGACCG TGCGCGACGC GCTGACCGAC CTGTTCGGCC CCGGCGCGGA TGCCACCGCG
ACAGGACCTG CGGCGACGGA ACCGCCCGCC GGTCAGGCGC CGCAACCGCA GGGGAACAAC
CAGCCGCCTG CCGCGGCACC GCCGAACCGG CCGGGACAGG CCCCGACGCC GCAACAGCCG
GAGGTGCCGG TGGCGGTGCC GCCGACCGGG CCGACCCAGC TGTCCGCCGG GAAAGCTGCT
GCGCTGCAGG ACGTCAACGC GGCACTGGAC GCGCTGCAGG ACGCGCAACG CAGCGGTGAT
TTCGCGCAGT ACGGTGAGGC GCTGCAACGC CTCGACGACG CGGTGAACAA GTACCAGGCG
ACGAACTAG
 
Protein sequence
MRPAARMPNL TRRSRVMIAV ALAVVVLLLL GPRLVDTYVN WLWFGELGYR SVFTTQIVTR 
LLLFLAVAVV FGAVVFAAMA LAYRTRPVFV PTAGPNDPIA RYRTAVMARL RLVGIGVPVA
VGLLAGLIAQ NYWQRVQLFL HGGSFGVSDP QFGIDLGFYA FDLPFYRLML TYLFAATFLA
FIANLLGHYL FGGIRLAGRS GALSRAARIQ LIALVGFLML LKAVAYWLDR YELLSHTRGG
KPFTGAGYTD INAVLPAKLI LMVIAVICAA AVFSAIVLRD LRIPAIGVVL LLLSSLIVGA
GWPLVVEQIS VRPNAAQKES EYISRSITAT RQAYGLTDEA VEYRDYPGNA TATAQQVAAD
RATTSNIRVL DPNIVSPAFT QFQQGKNFYF FPDQLNMDRY RDEDGNLRDY VVAARELNPD
RLIDNQRDWI NRHSVYTHGN GFIASPANTV RGIANDPNQN GGYPEFLASV VGANGEVVSP
GPAPLDQPRI YFGPVIANTP ADYAIVGESG TPREYDYETN TATRNYTYTG SGGVPIGNWL
TRSVFAAKYA ERNFLFSNVI GENSKILFNR DPADRVEAVA PWLTTDTAVY PAIVNKRIVW
IVDGYTTLDN YPYSELMSLS SATTDSNEVA LNRLQPDKQV SYIRNSVKAT VDAYDGTVTL
YAQDEQDPVL QAWMKVFPDT VKPKADITPE LQEHLRYPED LFKVQRALLA KYHVDDPVTF
FSTSDFWDVP LDPNPTASSY QPPYYIVAKD LAENNNSSSF QLTSAMNRFR RDFLAAYISA
SSDPETYGKL TVLTIPGQVN GPKLAFNAIS TDTAVSQDLG VIGRDNQNRI RWGNLLTLPM
GQGGLLYVAP VYASPGASDA ASSYPRLIRV AMMYNDQIGY GPTVRDALTD LFGPGADATA
TGPAATEPPA GQAPQPQGNN QPPAAAPPNR PGQAPTPQQP EVPVAVPPTG PTQLSAGKAA
ALQDVNAALD ALQDAQRSGD FAQYGEALQR LDDAVNKYQA TN