Gene Mvan_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0041 
Symbol 
ID4644895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp52968 
End bp54452 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content67% 
IMG OID639803552 
Producthypothetical protein 
Protein accessionYP_950898 
Protein GI120401069 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.532509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGAC ATTCGAAGGT TTCCGGAATC GGTTGGGCAG CATGGGTTAC GAGTGGAATC 
GCCGTGCACG TGACGGTTGT CGTCGCCTGC ACGTATGTGA GCGGCGGCCC GTTGGCCGCC
ACCGACTTGG CAGCGGCCAC CAGCATCTTC GTCGATGGTT CTAAACCGAT CCTGGGAGGA
CGTGAGGACG GGGTGCCGTT CTACCGGATG GCCGACTCGT TCAAGGGCGG CTATAGCGGC
CCGGAGTACA TCGAGAACTT CGTCATCTAT CCGCGCAGCC TCGGCGCGCT CACCGGCTGG
GGCGATCCGA CCTACGACGA CTCCGAGGAC GAAGGCACCG TCAACACGAT CGACGCGGTC
CGGAATGCCA AGAAAGACAC CGCGTTTCGG GCCGGTGATC CGATTCGCAT CGTCGGATAC
TCGCAAGGAG CAGGAGCAGC CTCGGCCGCG ATTCCCGAGC TGGAAGGCGG CGAATTCGCT
GACGACAACA TTCAGTACGT CCTGGCAAGC AACCCGTCCC GCAACGACGG CGGCATCCTG
ACACGGTTTC CGAAAGGGAC CTACCTGCCG ATCATCGGGG TGTCCTTCGG TGACGGGGTG
AGCACGACCG ACCCGGATAC CGACGTCGTG CAGGTGACCA AGCAATACGA CGGCGTGGCA
GACGCGCCGG ACTACGTGCT GAACGTCGTC GCCGACGTGA ACGCCGTTCT TGGCTTCGCC
TACCTGCACT CCGGCTACTA CAAAGACGTC GAGCGCGTCG ACCCCGAGAC CCTCGACCCC
GACGACCCGC CGGCCGGAAT GCTCGTCTCC ACCAACGCCG CCGACACCGT CACCGATGTG
GTGCTCGAGG CGCCCGAGGG CGAGCTGCCG TTGACCATGC CGCTGCGCCA ACTCGGTGTG
TCTGACGACG TGGTTGTTGC ACTCGACCCG TTCCTGCGCT CGGTGATCGA GACCGGCTAC
GACCGGCCGG TCGGGTCGGG CGAGTATCCC GACGAACCGG AACCGTTCAG GTTGGTGCCA
CCGGCCGATC AATGGGAGTC GGACGCCGCA TCTGTCGCCG AGGGGCTAGC GGAGACCAAG
CGCCGCCTCG CCGCGCTGCG CGACACCGAC ACGGGCGCAG GCCAAGACGC GAGCGATGAC
GACGCCGGGC CACTCGCGAC GGGATCAGAG GACGCCGATC GGCACGACCG CCGCACGCCG
TCATCGGAAG TCGACACGAC GGACACCATC CACGGCCCCT CATGGTCGGA CGAAGACGCG
ACCGACGACA CCGACAGCGG CAATCCCCAG AAGGTCGGCC CATCCTCCGA TCACGTTGGC
TGGAACCAGA TATCGCGGGC ATCCGCCCTA TCGCCGGGAG CGGACGCTGC CGCCGGCACG
CCGCCAGCCG AGTCCACCGC GCACCCGGAT GACGATGCCC GCCGATCGCC CGAGTCGGAG
CCGCTGCAGC GCAGCGAGAA GACCCGCCGT CGCGGGGCAT CGTAG
 
Protein sequence
MGRHSKVSGI GWAAWVTSGI AVHVTVVVAC TYVSGGPLAA TDLAAATSIF VDGSKPILGG 
REDGVPFYRM ADSFKGGYSG PEYIENFVIY PRSLGALTGW GDPTYDDSED EGTVNTIDAV
RNAKKDTAFR AGDPIRIVGY SQGAGAASAA IPELEGGEFA DDNIQYVLAS NPSRNDGGIL
TRFPKGTYLP IIGVSFGDGV STTDPDTDVV QVTKQYDGVA DAPDYVLNVV ADVNAVLGFA
YLHSGYYKDV ERVDPETLDP DDPPAGMLVS TNAADTVTDV VLEAPEGELP LTMPLRQLGV
SDDVVVALDP FLRSVIETGY DRPVGSGEYP DEPEPFRLVP PADQWESDAA SVAEGLAETK
RRLAALRDTD TGAGQDASDD DAGPLATGSE DADRHDRRTP SSEVDTTDTI HGPSWSDEDA
TDDTDSGNPQ KVGPSSDHVG WNQISRASAL SPGADAAAGT PPAESTAHPD DDARRSPESE
PLQRSEKTRR RGAS