Gene Mvan_5029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5029 
Symbol 
ID4644639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5380444 
End bp5382480 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content64% 
IMG OID639808500 
Producthypothetical protein 
Protein accessionYP_955807 
Protein GI120405978 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.120961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGCCG CCTCGGATAT CTTCGCGGGC CCGATCGTGC GGCGCAGTTC GCCTCCACGT 
ATCGCGGTCT GGTTGGCGAC GACCAAGCCT GTCGAACTCG ACGGACTGGT GCGCAAGGCC
GGCACCGCAG AGTGGATCGG CCAAACCACA ACGGTCGACC GGATCAAGCT GTTCGACGGG
CTGTTCGCCT ACCTGGTCCA GATCGCACCG ACCGCGGGCC ATTTCCCGAC GGGCACCCTG
CTGGAGTACT CGCTGGGCAC CGTCGGCAAG GACGGCGAGG CCGATCATTC GTGGTTCAAA
TCGGTGGTCG CCGAGGACGG CCTGGCCTAC CCGGGCTTCG AGCTTCCCAC GCTGTACCTA
CAGGCGGCGG GGGCGAAGCT GAACGTGCTC TACGGGTCGT GCCGCAAACC GCACGACATC
GACGGCGGCG ACAACGACGC CCTGGCCTAC GGCGACTCAC TGGTGCTCAA CAACGTCACG
ACGTTGGCGG CTCGGCCCAC GATCCTGTGC CTGGCGGGTG ATCAGATCTA CGCCGACGAT
GTCCACGACG CCACTTTCGA TGCGATCAAC ACGGTCGCCA CGAAGCTGGA AGGCTCCACT
CCGGAGAAGA TGCCCAACGG CGCGACGGTC CCGGGGAGGG GCCAGCGCTT GAAGTGGACC
ACGGATAAGG CCGGGTTTAC CAGTGGCGAA GCGGCCAATC ACCTCGTCAC GTTCGCCGAG
TATGTCGCCC ACTACGGGCT GGCCTGGAAC AGGCGCAACT GGCCGACCGT GTCGGTGGCC
AAGGAGGTGG TGTCCTACCG TGACGGCCTG CCGGCGGTCC GGCGGCTGAT GGCCAACACC
CCCACCTACA TGATGTTCGA CGACCACGAC GTCACCGACG ACTGGAACTT CTCGATCAAC
TGGGCCGCCC AGGTCAAGAG CTCCGCGGTC GGTACCAGGA TCATCACCAA CGCGCTGGCG
GCGTATTGGA TCTTCCAGGC CTGGGGTAAC GACCCCAAGG TCAGCGCACG CGAAACCCCG
CACATGCGTG ACGCTTTGGC TAAGCGGCTG ACCGACCCGG TCGAGCTCGA GACCGTGGTC
GGTACGAAGT CGACGCTCAA CGAGTGGGAA TTCCAGACCC CGACCATTCC GGTGCTCTAC
TTCCTCGACA CCCGAACCAA CCGCGGTTAC AAAGACGACT TCAAGCGCGC CGACGGTGGC
GAGCCCGCGT TCCTCAAGTC TGCTGCCGCG TGGCTGCCCA CCCGCGACCG CCTCAACAAG
ATCGTCGACG TGCAACAGCG CAGCGTTCCA CTCGTCCTGG TCACCGCGGC ACCGGTGTTC
GGATTCGAGA CGATCGACTC GGCACAGATG GCGGTGTCCG GAAAGATCGT CCCGACATCG
TATTTCGACC TGGAAGGCTG GGCCGCCAAC CGCGCGCATC TGTTTCTGTT CCTGACACTG
TGCCGTGATC ACGACGTGGT GGTGCTCTCG GGGGACGTGC ACTACGCGTT CACCTCGACC
GCATCGTTCG CGGTTTTCGA CGCCGCGTTC GTCCGCGGTG CGGCCGCCAG AGTTCCGGGT
ATGACGCTAC CTGGGGTATC GGGCTCCACA GCCACCAACG CTCACCTGTA CACCTCGCGG
TTCGTTCAGC TCAACTCCAG CGGCACCCGC AACTCGGTCG GCGGGTGGGG CAAGACCAAG
ATGATGTCGG GAATGGCCAA CAATCAGGGT GAGAGCGGCT ACATGTTCGC CGGCAACCCG
ATGTCGCCCG AGCCCGCGCG ATTCAAGAAC GGCAGGCTCT ACACCCCGAT CACGGTGATG
GGGGTGACGG TGTGGCGGGA ACGCGACCTG GCCGACGTCC AGCCGAACTT CGTGTACGAG
CAGCGGATCA ACGACCCGGG TAACACCCGG TTCATCAACA AGCACAACCT GGGATACGCA
CAGTTCCTGG GCCGCAAGGT CCAGAACGGC TTCCTCGTCG ACGGAAAACT GGACCCGTCG
AATTCGACTT CGTTCGACTT CGGCAGCGGG GCCGCCTGGA CACCCGCGGC ACCGTGA
 
Protein sequence
MLAASDIFAG PIVRRSSPPR IAVWLATTKP VELDGLVRKA GTAEWIGQTT TVDRIKLFDG 
LFAYLVQIAP TAGHFPTGTL LEYSLGTVGK DGEADHSWFK SVVAEDGLAY PGFELPTLYL
QAAGAKLNVL YGSCRKPHDI DGGDNDALAY GDSLVLNNVT TLAARPTILC LAGDQIYADD
VHDATFDAIN TVATKLEGST PEKMPNGATV PGRGQRLKWT TDKAGFTSGE AANHLVTFAE
YVAHYGLAWN RRNWPTVSVA KEVVSYRDGL PAVRRLMANT PTYMMFDDHD VTDDWNFSIN
WAAQVKSSAV GTRIITNALA AYWIFQAWGN DPKVSARETP HMRDALAKRL TDPVELETVV
GTKSTLNEWE FQTPTIPVLY FLDTRTNRGY KDDFKRADGG EPAFLKSAAA WLPTRDRLNK
IVDVQQRSVP LVLVTAAPVF GFETIDSAQM AVSGKIVPTS YFDLEGWAAN RAHLFLFLTL
CRDHDVVVLS GDVHYAFTST ASFAVFDAAF VRGAAARVPG MTLPGVSGST ATNAHLYTSR
FVQLNSSGTR NSVGGWGKTK MMSGMANNQG ESGYMFAGNP MSPEPARFKN GRLYTPITVM
GVTVWRERDL ADVQPNFVYE QRINDPGNTR FINKHNLGYA QFLGRKVQNG FLVDGKLDPS
NSTSFDFGSG AAWTPAAP