Gene Mvan_5054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5054 
Symbol 
ID4644791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5405923 
End bp5408178 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content71% 
IMG OID639808525 
Producthypothetical protein 
Protein accessionYP_955832 
Protein GI120406003 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGA AGAGTCCGGG CGTGCCGCTG GGCGCCTGGT TGGCCGAATT GGACGACGCG 
CGGCTGATCC GGCTGCTGCG TCTGCGGCCT GATCTCACCC AGCCTCCACC GGGCAGCATC
GCCGCGCTGG CTGCCCGCGC CGCCGCGCGC CAGTCGGTCA AGGCCGCCAC CGACGACCTG
GACTTTCTGC ATCTGGCGGT GCTCGACGCC CTGCTCACGC TGCACGCCGA GACGACGGCG
GTCACGCTCG CCGAGCTGGC CGGGACTTTC GGTGAACGCA TCGACAAGAC AGACATGCGC
GCCGCGCTCG ACGAACTGTC GGCGCGCGCT CTGGTGTGGG GTGACGGGGC GCTGCGGGTG
GTCGCGGAAG CGGCGTCAAG TCTGCCCTGG TATCCCGGGC AGGTCAGCCT GGAAAGCCCG
ACGCAGACCG GCGCCGAGGT CGCCGCGGCA CTCGAGACCG TGGACGGCCC GGCGCGCGAC
CTGCTGGACA AGCTGCTGGA GGGGTCCCCG ATCGGCCGGA CCCGGGATGC GATACCGGGA
ACCCCACCGG ATCGGCCCGT CCAGCGGCTA CTGGCGGCGG GCCTGCTGCG CAGAATCGAC
GACGACACCG TCATCCTGCC CCGGCTCGTC GGTCAGGTGC TGCGCGGCGA GAGCCCGGGC
CCGGTCACCT TGTCCGCACC CGACCCGACG GTGACGTCGA CGACCGCCGC CGACGTCGAC
GCCGTGGCCG CCGGTGCGGC GATCGACTTC ATGCGTGAAG TCGAGGTCCT TCTCGAAACA
TTGAGTGCGG CACCCGTTCC CGAGCTTCGC AGCGGCGGTC TCGGCGTACG GGAAGTCAAG
CGGCTGACCA AGGTCACGGG CATCGACGAA CGTCGACTGG CCCTGATCCT TGAGGTTGCC
GCGGCCGCCG GCCTGATCGC GCCCGGCATG CCGGAACCAG ACCCCCTCGA CGGCGCGGGG
TCACATTGGG CGCCCACGGT GGCGTTCGAT CGGTTCGTCG AATCGCCCAC CTCCGCCAAG
TGGCATCTGC TGGTGTCGTC ATGGCTGGAG CTGCCGGGGC GCCCCAGCCT GGTCGGCAGC
CGCGGCCCCG ACAACAAACC CTATGCGGCG CTGTCGGATT CGCTGTTCTC GACGGCGGCG
CCCCTTGATC GCCGGTTACT GCTTGAGGTC CTGGCCGATC TGCCACCGGG CTGCGGGGCC
GACGCCGACA GCGCCTCAGA GGCGATGCTG TGGCGACGGC CCAGGTGGTC GGTGCGACTG
CAGACCGGCC CGATCGGCGA CATGCTCACC GAGGCTCATG CCGTCGGGGC GGTGGGCCGC
GGCGCGGTCG CGTCACCGGT CCGGCGCATG CTCGCCGGCG ACGGGGACGA CGCAGTCGTC
GCCGCCATGG AGAAGGTGCT TCCTGCGCCT ATCGACCACT TCCTGCTGCA GGCGGACCTG
ACCGTCATCG TGCCCGGCCC GCTGGAACGG GCCCTTGCCG AACAGCTGGC GGCCGTGGCC
ACAGTCGAGT CGGCCGGTGC GGCCATGGTG TACCGCATCG ACGAGGCATC GGTGCGGCGA
GCGCTGGACA CCGGCAAGAC TGCGGGCGAG ATTCACGCTC TCTTCAACCG GCATTCGAAA
ACACCTGTGC CACAAGCGCT GACCTATCTG ATCGACGATG TGGCGCGGCG GCACGGACAG
CTGCGGGTGG GCATGGCTTC GGCGTTCGTC CGGTGCGAGG ACCCCGCGCT GTTGGCACAG
GCGGTGGCCG CACCGGCCAC CGAACGCGTC GAGCTGCGCC TGTTGGCGCC GACCGTGGCG
GTGTCGCAGG CTCCGATCGC CGACGTCCTC GCTGCGTTGC GGACGGCCGG CTTCGCACCG
GCGGCAGAGG ATGCGACGGG TGCAGTCGTC GACCTGCGCA GCCGGGGCGC CCGCGTACCG
TCACCGGGCC GTCGCCGCGG GTATCGGCAC GGTCCTACGC CGACGGATCA GACGCTCGCG
GCGATCGTCG CCGTGCTACG CAAGGTCGCG TCGACGCCGT CGCCAGGCAT GCGGCTGGAC
CCCGCGGTCG CGATCTCCGA GCTTCAGCAC GCCGCACTGC ACCAGGAATC TGTGGTGATC
GGCTATGTGG ATCCGGCCGG GGTGGCGACC CAGCGGGTGG TGGCCCCGAT CAACGTCCGG
GGCGGGCAGC TCACCGCTTA CGACCCGGCG TCGGGGCGGG TGCGCGAGTT CGCGATCCAC
CGGGTGACGT CGGTGGTGTC GGCCGACTCC GGGTGA
 
Protein sequence
MSVKSPGVPL GAWLAELDDA RLIRLLRLRP DLTQPPPGSI AALAARAAAR QSVKAATDDL 
DFLHLAVLDA LLTLHAETTA VTLAELAGTF GERIDKTDMR AALDELSARA LVWGDGALRV
VAEAASSLPW YPGQVSLESP TQTGAEVAAA LETVDGPARD LLDKLLEGSP IGRTRDAIPG
TPPDRPVQRL LAAGLLRRID DDTVILPRLV GQVLRGESPG PVTLSAPDPT VTSTTAADVD
AVAAGAAIDF MREVEVLLET LSAAPVPELR SGGLGVREVK RLTKVTGIDE RRLALILEVA
AAAGLIAPGM PEPDPLDGAG SHWAPTVAFD RFVESPTSAK WHLLVSSWLE LPGRPSLVGS
RGPDNKPYAA LSDSLFSTAA PLDRRLLLEV LADLPPGCGA DADSASEAML WRRPRWSVRL
QTGPIGDMLT EAHAVGAVGR GAVASPVRRM LAGDGDDAVV AAMEKVLPAP IDHFLLQADL
TVIVPGPLER ALAEQLAAVA TVESAGAAMV YRIDEASVRR ALDTGKTAGE IHALFNRHSK
TPVPQALTYL IDDVARRHGQ LRVGMASAFV RCEDPALLAQ AVAAPATERV ELRLLAPTVA
VSQAPIADVL AALRTAGFAP AAEDATGAVV DLRSRGARVP SPGRRRGYRH GPTPTDQTLA
AIVAVLRKVA STPSPGMRLD PAVAISELQH AALHQESVVI GYVDPAGVAT QRVVAPINVR
GGQLTAYDPA SGRVREFAIH RVTSVVSADS G