Gene Mvan_4373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4373 
Symbol 
ID4649393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4691515 
End bp4693992 
Gene Length2478 bp 
Protein Length825 aa 
Translation table11 
GC content70% 
IMG OID639807844 
Producthypothetical protein 
Protein accessionYP_955155 
Protein GI120405326 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGG CGCCGCTGCA GGTTCGCCTG CAACGGGCCG ACGACGTCGA GATGATCGGG 
AAGCTGCCGG GCTCTGGATA TCGGGTGCCG CCGGCGCTGG TGCGCCGCGC CGACGGTCAG
ACGGTGCAAC TGACGCCGCT CCTGTACGCG ATCCTGCAGG CCGTCGACGG TGACAAGACC
GCCGCGGAGG TGGCCGCGGC GGTCAGCGAA TCCACCGGGC GCTCGGTCAG CGAAGCCAAC
GTCGACCAGT TGGTCGAGGA GCAACTGCGT CCACTCGGGC TGATGACACT CCCCGACGGT
AGCCAGCCGG CCACCAGGAA GCGAAACCCG TTGCTGGGGC TGCGCTTTCG CTACGCGGTC
ACCGATGCCG ACCGCACCCG CAGGCTGACC GACCCGTTCC GGGTGCTGTT CCGGCCGTGG
GTGGCAGTGC CCGCACTGGC GGCGTTCGCC GTCGTGTGCT GGTGGGTGTT TTTCGAGAAG
GGTCTTGCGT CGGCCGCGCA CGACGCCTTC GAGCGGCCCG GGCTGCTGAT CCTGGTTTTC
GTCGTCACCA TCGTCTCGGC GGGATTCCAC GAGTTCGGTC ACGCGGCGGC CGCGCGGTAC
GGCGGCGCCA CACCAGGGGT GATGGGTTTC GGCGTGTACC TGGTCTGGCC GGCGTTCTAC
ACCGACGTCA CCGATTCCTA CCGGCTCGGA CGAGGTGGGC GGCTGCGAAC CGACCTGGGC
GGTCTCTACT TCAATGCGCT GGTCGCGATC GTGATCACCG GCCTGTGGTT GTGGCTGCGC
TACGACGCGC TGCTGCTCGT CGTCGCCACG CAGATCCTGC AGATGCTGCG CCAACTGGCC
CCGCTGGTGC GGTTCGACGG CTACCACGTG CTGGCCGATC TGACCGGTGT GCCGGATCTC
TACTCGCGGA TCAAGCCGAC ACTGCTCGGC GTACTGCCGT GGCGGTGGGG CGACCCGCAG
GCCCGACAGC TCAAGTGGTG GGCGCGCGCC GTGGTGACGC TGTGGGTGAT CCTCGTCGTG
CCGCTGCTGC TCGCGACCGT CGCCATCGCC GTGTGGGCGC TGCCCCGGGT GCTGGGTTCG
GCATGGGCGA GCCTGAGGAC TCAGCGGGAG GTCTTCGTCA CCGCCTGGGC CGACGGCGAT
GTCGTTCAGG CGGTCGCGCG CGTGCTGGCG ATGATCGCGA TCGTCATCCC GGTGGCCGGT
GTGCTCTACA TGCTGGGGCG GCTCGCCCGT CGCACCGCTG CCGGCTCCTG GAAGGCCACG
GCGGGCAAGC CGCTGATGCG GACGATGGCG ATGATGGCCG GTGGTGTCGT CCTGTGCGGG
GTGGCGTACG CATGGTGGCC GCAGGAGGGG CGGTATCAAC CCATCCAGCC GTGGGAGCGC
GGCACCCTCG GCGACATCGT CTATGCGTTG AAGATCGATC GGATGAATCG GCCGGCGCAG
CCCGACACCG TGGCCGCGCC GCGTCGGTTG GTCAACGGGC AACAGGGCGT GATGCAGGCG
GTTTGGGACG CCCGGATGCC GACGCCCACA CAGCAGTCGC CGCAGCTCGC GCTCGTGCTG
ACGCCGCGGA CCACGGTGGC GGAAACGCCA TTTCGCGGGG GCAGTGGTGG GGGAGCGGCC
GCCGCGGCGC CCGATCAGGT GGCCGACGGT TGGGTGTTCC CCGTCGACAA GCCGCTCGCG
CCCGAACCGG GGGACACCCA GGCGCTGGCC GTGAACACGA CCGACGGCAC CACCGTGTAC
GAGGCCGCGT TCGCAATGGT GTGGATCGCC GACAACTCCG ACGCGATGAA CGTCAACGAG
GCGCAGGCCT ACGCGTCGTG CGAGTCGTGC GCCGCGGTCG CGGTGGCCTA CCAGGTGATC
TTCGTCGTCG ACACCGACGA CGCCGACGAC AACGTGGTGG CTCCCCAAAA CCTCGCGGGC
GCACTCAACT ACAACTGCGT CAACTGCCTG ACCTATGCAC TGGCCCGCCA GATCTTCGTC
ACTCTCGACG ACCCGCTGTC GCAACAGGCG ATGGACGAAC TCGACGCGCT CTGGGGCAAC
ATCGCGGCCT TCGCCGAACA GATCGAGGCG GGCACGGTAC CGCCGGAGGA CATCGAAGCG
CGGCTGGCGG TGTACACCGA GCAGCTGATG GGCGTCCTCC AGACCGCTGC ACCGACGGCG
ATACCGCAGC AGACCGCGAC CTCGACCGCG CCGACCACTT CCAACGGCGC GCCCACTCCA
ACGGCCGATG CCCCGGCGAC GACCAGCGCC CCGTCGCCGA CGGCCCCGGA GACGCCCGTT
ACGGAGACGA CGGCCCCGGA GACAATCCAG GCGGTGGCGC CGGCGACCAC CGAAACCGCC
GCCAGCAGCG AGGTCGCCGA ACCGACGGCC GGCCCCACCA CTTCGGTGTC GTCGCCAGAC
CCGTCGGCGA CGAGCGAACC GGAGGCGAGC GGTGCATCCA CAGGTGCCGA CACGGCGGAC
GCCGACACGA CAGGGTGA
 
Protein sequence
MDAAPLQVRL QRADDVEMIG KLPGSGYRVP PALVRRADGQ TVQLTPLLYA ILQAVDGDKT 
AAEVAAAVSE STGRSVSEAN VDQLVEEQLR PLGLMTLPDG SQPATRKRNP LLGLRFRYAV
TDADRTRRLT DPFRVLFRPW VAVPALAAFA VVCWWVFFEK GLASAAHDAF ERPGLLILVF
VVTIVSAGFH EFGHAAAARY GGATPGVMGF GVYLVWPAFY TDVTDSYRLG RGGRLRTDLG
GLYFNALVAI VITGLWLWLR YDALLLVVAT QILQMLRQLA PLVRFDGYHV LADLTGVPDL
YSRIKPTLLG VLPWRWGDPQ ARQLKWWARA VVTLWVILVV PLLLATVAIA VWALPRVLGS
AWASLRTQRE VFVTAWADGD VVQAVARVLA MIAIVIPVAG VLYMLGRLAR RTAAGSWKAT
AGKPLMRTMA MMAGGVVLCG VAYAWWPQEG RYQPIQPWER GTLGDIVYAL KIDRMNRPAQ
PDTVAAPRRL VNGQQGVMQA VWDARMPTPT QQSPQLALVL TPRTTVAETP FRGGSGGGAA
AAAPDQVADG WVFPVDKPLA PEPGDTQALA VNTTDGTTVY EAAFAMVWIA DNSDAMNVNE
AQAYASCESC AAVAVAYQVI FVVDTDDADD NVVAPQNLAG ALNYNCVNCL TYALARQIFV
TLDDPLSQQA MDELDALWGN IAAFAEQIEA GTVPPEDIEA RLAVYTEQLM GVLQTAAPTA
IPQQTATSTA PTTSNGAPTP TADAPATTSA PSPTAPETPV TETTAPETIQ AVAPATTETA
ASSEVAEPTA GPTTSVSSPD PSATSEPEAS GASTGADTAD ADTTG