Gene Mvan_4867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4867 
Symbol 
ID4643845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5210746 
End bp5211876 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content70% 
IMG OID639808338 
Producthypothetical protein 
Protein accessionYP_955646 
Protein GI120405817 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.354636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.601837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTCG TCGTCGGCGG CGTCGTGCCC GTCGAGGACG TCGTCGGCAG GGTGCGGGAA 
TCCAACGAGG TCCTCGCCTC GCTGCCGAAC TCCGGCGCGG TCCTGGTCGG CGACCGCAGG
CACGGCAAGA CGTCCCTGGC CCGGCTGGTG CAGCGGATGG CGGCCGACAG CGGCGCCGTC
GTCGTCTCGG TCAGCGCCGA ACGCGAGACC TACGCCGAGT TCGTCGCCGC GTTGATCTCC
GAACTGGCCC GCCTCGACCC GGCATGGGCC CAGGAACTCG CCCGGATCCG CCTCACCCTG
ACCGCGGGAC CCGTCCGGCT GGAACGGGAC AGCCGCGCCG CGGCGACCCT CGACGACCTG
CTCGACCGTG CCGTCCGGCG GGCAGGCAGC CGCATCCTGG CGCTGTTCAT CGACGAGGTC
AGTGTGTTGG CACGCAATCT GGAGCGCGCC TCGCCCGGGT CCGGGGACAC CTTCCTGCAT
CTGCTGCGCC GGGTCCGCCA GGAGAATCCG GGCCGGGTGG CCACGGTGCT CTCGGGGTCC
ATCGGCTTTC ACCACGTGTC GGCCGACGCG CCCAGCACCG TCAACGACAT CCCGAAGATC
GCCGTCGGGC CCATTCGCTC CGACCATGCG ACCTACCTCG CCGAGTGCCT GCTGCTGGGC
AGCGGCGTCC CGACCACCGA CCGGCACGAG GTCGCCGCGG CGATCGCGGC GGCCGCCGAG
AACGTGCCCT ACTACATCCA GCACCTGGTG GCCGCGGCCC GCAAGTCGTG GCAGGACACC
CAGGTGGCCC CGTATCCGGA GCTGATCGAC CTGCTGGTAC TGGATGCCAT CGAAAGCCCT
TACGACCCTT GGGATCTGCG ACACTACCGG GATCGGTTGC CGCACTACTA CGGCGCCGAC
GCGCCCGCGA TCGCCGGGTT GCTCGACATC TACGCCCACG CCGCAGGGCA GGTGGGCGTC
GACACCGTGC TGATGCGTCT GCGCAGCGAG GGCAGCCCGA TCGGTGACCG GGCACAGCTG
GTGTCTTTCA TCGAAAGGCT CACGCTGGAC CACTATCTGG TGCGCACCGG CGACACCGAT
GGGTTCGCCT CTCCGCTGTT GCAGCGGGCA TGGAAGGCCA TGCGGCGGTG A
 
Protein sequence
MSLVVGGVVP VEDVVGRVRE SNEVLASLPN SGAVLVGDRR HGKTSLARLV QRMAADSGAV 
VVSVSAERET YAEFVAALIS ELARLDPAWA QELARIRLTL TAGPVRLERD SRAAATLDDL
LDRAVRRAGS RILALFIDEV SVLARNLERA SPGSGDTFLH LLRRVRQENP GRVATVLSGS
IGFHHVSADA PSTVNDIPKI AVGPIRSDHA TYLAECLLLG SGVPTTDRHE VAAAIAAAAE
NVPYYIQHLV AAARKSWQDT QVAPYPELID LLVLDAIESP YDPWDLRHYR DRLPHYYGAD
APAIAGLLDI YAHAAGQVGV DTVLMRLRSE GSPIGDRAQL VSFIERLTLD HYLVRTGDTD
GFASPLLQRA WKAMRR