Gene Mvan_1131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1131 
Symbol 
ID4648566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1200869 
End bp1202338 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content67% 
IMG OID639804630 
Producthypothetical protein 
Protein accessionYP_951973 
Protein GI120402144 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGCCGC CCCTCCCTAG TCTCAAGGTC GTGGACTACG CCGCCCTCAC CACGCTGCGC 
GAGCGACACC CCGCCTGGCG GCTGTTGCGC GCGGGCAATG CCGCGCTGGT CCTGTCCTTC
CTCGGCCAGT TCTTCGTCGA AGACAACCGC GGCGCATGCT CGGCGAGTGT GCTCGCCGAC
GCCTTGGACG ACCACCTCTA CGCCCTGAAC GCCGACGAGG TGCGGTACCC GAAGCAGCCC
CGCGCGTATC TGGAGGACTG GGCCGCGACC GACGCCGGAT ACCTGAGGCG GTTCTACCCA
CCCGGCGACG ATGAGGTCCA CTACGAAGCC ACCCCGGCAT TCGAGAAGGC CTACGCCTGG
GTGCACAGCC TGCAGGGCAG GTCCTTCGTG GGCACCGAAT CCCGGCTGCA CACGGTGATC
GCACTGCTGC GCCAGATTGT GCAGGGCACC GAGACGGAGC CCGAAGTCCG GCTGGCCGAT
CTGCGCCGTC GCCGCGATGA GCTCGACGCC GAGATCGCGG CGGTGGAAGC AGGGCACGTT
GCCATCTTGG ACGCCACCGG AGTCCGCGAC CGCTACCAAC AGTTGGCGAC CACCGCGCGC
GAGTTGTTGT CTGATTTCCG TGAGGTCGAG GAGAACTTCC GCCTGCTGGA CCGGGCCGCT
CGGGAAAAGA TCGCCGCCTG GGATGGATCC AAAGGGGAAC TGCTGGCCGA ACTGATCGGC
AGTCGCTCCG AGATTGCCGG GTCTGACCAG GGGCGTAGCT TTCAGGCGTT TTTCGACTTC
CTGCTCTCCG AGCAGCGGCA AACCGAGCTT GCCGACCTGC TCGCCAAGGT GACCGCCCTG
GACAGCGTCG ACGCCGACGA GCGGATCAGG GGTATCCACC ACGACTGGTC CGAGGCTGCC
GACCGCGCCC AGCGCACCGT TCGCCAGATC TCCGAACAGC TTCGCCGGTT CCTCGACGAC
CAGGTGTGGC TGGAGAATCG GCGGGTGTTG GATCTGGTGA GGGCCGTCGA GGCCGCCGCC
CTCGACGTCC GCGACACCCC GCCGTCGTTC GGGCTGGAGG TCGATGAACA GGGCATCGAC
ATCGCCCTGC CGTTCGAGCG CCCGCTCTAT CAAACGCCGG TCGCGGTGGC AGTCGAGAGC
CGTATCGCCG ACGGCACCGA AAATGTCGAC GCCGACCTGC TGTTCACCCA GACCTATGTC
GACCAGGTCC GGCTGGCCGA GACCATCCGC GCCGTCCTGC CGGAGAACTC GTCGGCGCTG
CTGTCCGACG TCATCGCGGT GCATCCCATC GAGCAGGGCG CCGCGGAGAT CGTCGGCTAT
CTGGCCCTCA ATGACGACGA CGTCAGCGTC GACATGGACG ACACCGATGA GACCGTGCTG
GACTATCCCG ACCCGGCCCA TCCCGACGTC ACCAAGCGCG CCCGGCTACC GAAGGTGACG
GTGCGCCGGC ACGGGGGCCA ACAACCATGA
 
Protein sequence
MSPPLPSLKV VDYAALTTLR ERHPAWRLLR AGNAALVLSF LGQFFVEDNR GACSASVLAD 
ALDDHLYALN ADEVRYPKQP RAYLEDWAAT DAGYLRRFYP PGDDEVHYEA TPAFEKAYAW
VHSLQGRSFV GTESRLHTVI ALLRQIVQGT ETEPEVRLAD LRRRRDELDA EIAAVEAGHV
AILDATGVRD RYQQLATTAR ELLSDFREVE ENFRLLDRAA REKIAAWDGS KGELLAELIG
SRSEIAGSDQ GRSFQAFFDF LLSEQRQTEL ADLLAKVTAL DSVDADERIR GIHHDWSEAA
DRAQRTVRQI SEQLRRFLDD QVWLENRRVL DLVRAVEAAA LDVRDTPPSF GLEVDEQGID
IALPFERPLY QTPVAVAVES RIADGTENVD ADLLFTQTYV DQVRLAETIR AVLPENSSAL
LSDVIAVHPI EQGAAEIVGY LALNDDDVSV DMDDTDETVL DYPDPAHPDV TKRARLPKVT
VRRHGGQQP