Gene Mvan_5801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5801 
Symbol 
ID4645744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6187975 
End bp6189126 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content69% 
IMG OID639809277 
Producthypothetical protein 
Protein accessionYP_956572 
Protein GI120406743 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3934] Endo-beta-mannanase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGCCCG AGCCTCTCCT GAACCGACGC GCCGCGCTCA AGGTGCCCGC GGTGCTGGCG 
GCGGGCATGG CGTTGTCGAC CGTGCCCCGC GCGTCCGCCG AACTCACCCG GTGGTCCCCG
GATCGCGCGC ACCGGTGGCA CCGGGCGCAG GGCTGGCTGG TCGGCGCGAA CTTCATCCCG
GCCACCGCGA TCAACCAGCT CGAGATGTTC CAGCCGGGTA CCTTCGACCC GCGACGCATC
GACAGCGAGC TGCGGACGGC AAAGCTGATC GGGCTCAACA CCGTTCGGGT GTTCCTGCAC
GATCTGCTCT GGGTGCAGGA CCGGGTCGGC TTCCAGCGCC GACTGGCCCG GTTCGTCGAC
ATCGCGGCCC ACCACGGCAT CAAACCGCTG TTCGTGCTGT TCGACTCGTG CTGGGATCCG
CACCCTCGAC TGGGTAAGCA ACGCGACCCG ATCCCGGGCG TGCACAACTC GGGCTGGGTG
CAGAGCCCGG GCGCCGAGCA CCTCAGCGAC CCGCGCCACC GCCGGGTCCT GCGGGACTAC
GTGGTCGGTG TGCTGAGCCA GTTCCGTCAC GACAAGCGGG TGCTCGGCTG GGACCTGTGG
AACGAACCCG ACAATCCCGC CGACGCCTAC AAGGACGTCG AGCGCAGGGA CAAGGTGGAT
CGGGTGGCCG AGTTGCTGCC GCAGGTCTTT CAGTGGGCCA GGTCGGTCGA CCCCGTGCAG
CCGCTGACCA GTGGCGTCTG GGACGGCGAG TGGGGAGATC CGGCGCGCCG CAACGAGATC
AACCGGATCC AGCTCGATCT CTCCGACGTG ATCACCTTCC ACAGCTATGC CGATCGGAGG
GGGTTCGAGG CGCGGCTGGA GGAGCTCACC CCCATCGGGC GGCCGATGTT GTGCACCGAG
TACATGGCGC GCACGCTGGA CAGCACGGTG GAGACGATTC TGCCGATCAC CAGGCGCCGC
AACGTCGGGG CCTACACGTG GGGATTCTTC GCGGGCAAGA CGCAGACCTT CCTGCCGTGG
GATTCGTGGG ACCGTCCGGT GACCGGCCCG CCCGGGCTGT GGTTCCACGA CCTGCTCAAC
GGCGACGGCA GCCCGTACCG GGACAGCGAG ATCAACACCA TCCGCGAGCT CACCGGCAGG
CGAGGACCTT AG
 
Protein sequence
MPPEPLLNRR AALKVPAVLA AGMALSTVPR ASAELTRWSP DRAHRWHRAQ GWLVGANFIP 
ATAINQLEMF QPGTFDPRRI DSELRTAKLI GLNTVRVFLH DLLWVQDRVG FQRRLARFVD
IAAHHGIKPL FVLFDSCWDP HPRLGKQRDP IPGVHNSGWV QSPGAEHLSD PRHRRVLRDY
VVGVLSQFRH DKRVLGWDLW NEPDNPADAY KDVERRDKVD RVAELLPQVF QWARSVDPVQ
PLTSGVWDGE WGDPARRNEI NRIQLDLSDV ITFHSYADRR GFEARLEELT PIGRPMLCTE
YMARTLDSTV ETILPITRRR NVGAYTWGFF AGKTQTFLPW DSWDRPVTGP PGLWFHDLLN
GDGSPYRDSE INTIRELTGR RGP