Gene Mvan_3500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3500 
Symbol 
ID4649316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3718552 
End bp3720093 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content67% 
IMG OID639806977 
Producthypothetical protein 
Protein accessionYP_954301 
Protein GI120404472 
COG category[S] Function unknown 
COG ID[COG3333] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0583954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.298668 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAACT TCGACTGGCT CCTGCAGGGG TTCGCCGAGG CGGCAACTCC GACGAACCTG 
CTCTACGCCG TGATCGGCGT CCTGCTCGGC ACCGCCGTCG GCGTGCTGCC CGGCATCGGT
CCCGCGATGA CCGTCGCGCT GCTGCTGCCG ATCACCTACA ACGTCAGCCC CAGCGCGGCA
TTCATCATGT TCGCCGGCAT CTTCTACGGC GGCATGTACG GCGGCTCGAC CACCTCGATC
CTGCTCAACA CTCCCGGCGA GTCGTCGTCG GTGATCACCG CCATCGAGGG CAACAAGATG
GCGAAAGCGG GTCGCGCCGC CCAGGCGCTG GCCACGGCCG CGATCGGCTC CTTCGTGGCA
GGTTCGATCG GCACCGCGCT GCTGGCGGCG TTCGCGCCCA TGATCTCGCG GTTCGCCGTG
ACTCTCGGTG CTCCTTCGTA TCTGGCGATC ATGCTGTTCG CGCTCGTCGC GGTGACGGCG
GTTCTGGGTT CGTCGAAGAT GCGCGGGCTG ATCTCGCTGC TGCTCGGTCT CGCGATCGGT
GTGGTGGGTA TCGACTCGCT CACCGGTCAG CCCCGCGCCA CTTTCGGAAT CCCGCTGCTG
TCTGACGGCA TCGACATCGT GGTGATCGCG GTCGCCGTGT TCGCAGTGGG GGAGGCGCTG
TGGGTGGCCG CCCATCTGCG GCGCCGACCG GTCGACGTGA TCCCGGTCGG CCGCCCCTGG
ATGAGCAAGC AGGACTGGGG CCGGTCGTGG AAACCGTGGT TGCGCGGCAC CGCGTTCGGC
TTTCCGTTCG GTGCGCTGCC CGCCGGCGGC GCCGAGCTGC CGACGTTCCT GAGCTACATC
ACCGAGAAGA AGCTGTCCAA GCATCCCGAG GAGTTCGGCA AGGGCGCCAT CGAAGGTGTG
GCCGGACCGG AGGCGGCCAA CAACGCCTCC GCGGCAGGCA CCCTGGTGCC GATGCTGTCG
CTGGGCCTAC CGACCAACGC GACGGCGGCG GTGATCCTGA CCGCGTTCGT CTCGTACGGC
ATCCAGCCCG GACCCACGCT GTTCGACAAG GAGCCGCTGC TGATCTGGAC GTTGATCGCG
AGCCTGTTCA TCGGCAACTT CCTGCTGCTG GTGCTGAACC TGCCCCTGGC GCCGTTGTGG
GCGAGGTTGC TGCGCACGCC GCGGCCGTAC CTGTACGCCG GGATTCTGTT CTTCGCCACC
CTGGGTGCGT TCGCGGTCAA CCTGCAGCCG CTGGATCTGG TGCTGCTGCT GATATTCGGC
TTGATGGGTC TGATGATGCG CCGCTTCGGT CTCCCGGTGC TGCCATTGAT CATCGGTGTC
ATCCTCGGCC CGCGCATCGA ACGTCAACTG CGGCAGAGCC TTCAGCTCGG CGGCGGGGAG
TGGGGCAGCC TGTTCACCGA ACCCGTCGCG ATCATCACGT ATGTGCTGAT GATCCTGCTG
CTGGCCGCGC CGTTGGTGCT GCGGTTGATG CACCGCAGCG AGGAGACGTT GCTTGTGGTC
GAGGACGACC GGGACCAGAA AGAGAAGGCT GGGAAAGTGT GA
 
Protein sequence
MENFDWLLQG FAEAATPTNL LYAVIGVLLG TAVGVLPGIG PAMTVALLLP ITYNVSPSAA 
FIMFAGIFYG GMYGGSTTSI LLNTPGESSS VITAIEGNKM AKAGRAAQAL ATAAIGSFVA
GSIGTALLAA FAPMISRFAV TLGAPSYLAI MLFALVAVTA VLGSSKMRGL ISLLLGLAIG
VVGIDSLTGQ PRATFGIPLL SDGIDIVVIA VAVFAVGEAL WVAAHLRRRP VDVIPVGRPW
MSKQDWGRSW KPWLRGTAFG FPFGALPAGG AELPTFLSYI TEKKLSKHPE EFGKGAIEGV
AGPEAANNAS AAGTLVPMLS LGLPTNATAA VILTAFVSYG IQPGPTLFDK EPLLIWTLIA
SLFIGNFLLL VLNLPLAPLW ARLLRTPRPY LYAGILFFAT LGAFAVNLQP LDLVLLLIFG
LMGLMMRRFG LPVLPLIIGV ILGPRIERQL RQSLQLGGGE WGSLFTEPVA IITYVLMILL
LAAPLVLRLM HRSEETLLVV EDDRDQKEKA GKV