Gene Mvan_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1054 
Symbol 
ID4645365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1107057 
End bp1108547 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content70% 
IMG OID639804555 
Productglycosidase, PH1107-related 
Protein accessionYP_951898 
Protein GI120402069 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCA CCGGGATCCA GCTCGTGACC CGCAGCCCGC AGCGGGTTGC GGCCGACCCG 
GGCCGCGTGG TCACCCGGCT GTTCGTGCCG GGCCACGAGG GGTTCGAGCA TCAGGAGTCC
CGTGCGGGGG CGGTGCTGTC GCGGATCCTG GCCCTGACCG ACGACGAGGT GCGGGCGGCC
CTGCAGGATG TGCTGATCCG CTTCGACGGG CGTCACCGTG ATCTGACCGG CACCTTCCGC
CGGCACGCCA GGGAGCTCGC CGACCGGCTC GACCCCACCC GGGAGTTCAC CGAGGCGCGC
GTGCTGCTGC TGGGGGCCAC GTTCACCAAC GAGTACTCGA TCGAGGGTGC GGCACTGTGC
AATCCGAGCG TCGTCGCGCA CCCCGACCAG TCCGGCACCG TCCCGGGCAG CCTGCGGTTC
GTGATGAGCG TCCGGGGGAT CGGGGAGGGA CACCGTTCGA GCATCGGGTT CCGGACGGGC
GTCGTCGACT CGACGGGTCA CGCCACGATC GACGAGCCTG CTCCGTTCGC CTCCACCGGA
CGGGTCGAGC CCACCCTGCT GGACGCCGCC GTCTTCCGCA CCGAACTCCG TGACAAGGGC
TGCGGCGGCG AGGCCGCCGA CTACGTCTTC GATGCGCTCG GTGCGCTGTT CACCAGGTCC
GACCTGGACG AGCGGCTCGA AAGACTGCGC GCCCACCTGA GCACACGCGG ACATGTCGAG
GACACGATCG CGACCATCCG CGGTGTCGCC GCTCGCTGTT ACGCGGTCGA GTTCCCGGAT
GACACAACAC TTTCCGAGCG GGTGCTGTGG CCGGAGATGG AGGCCGAACA CGCCGGCATG
GAGGACGCCC GCTTCGTGCG TTTCGTCGAC GACGACGGTT CGATCCGCTA CCACGCGACG
TACACCGCCT ACAGCGGATC GCACATCAGC CAGCAACTGC TCACCACCGC GGACTTCCAG
ACCTTCACCT CCGGGCCCCT CGTCGGGAGT GCCGCCGCCA ACAAGGGGCT GGCGTTGTTC
CCTCGCCGCA TCGGCGGCCG GTACGCCGCG ATGTCGAGGT CGGACCGCGA GACCAACACC
GTCGCCTTCG CCGATGATCT GTCGGTCTGG ACCACGGCGT TGCCCTGCCA ACAGCCGGCC
GAGGTGTGGG AGACGCTGCA ACTCGGAAAC TGCGGTCCGC CGATCGAGAC CGACAGGGGC
TGGCTGCTGT TGACCCACGG CGTCGGGCCG ATGCGCACGT ACAGCATCGG GGCGATCCTG
CTTGACCTCG ACGATCCGAC CCGGGTGATC GGACGACTGC GACGGCCCCT GCTGACCCCG
GCGGCCGACG ACCGGGACGG GTATGTGCCC AACGTGGTGT ACTCGTGCGG CGCGCTCGTC
CACGCGGACA CCCTGGTGAT CCCGTACGGG ATCTGCGACA GCGCCATCGG TCTCGCGACG
GTCCCGCTCC CGGACCTGCT GGCCGAGCTC GCCGGGTCGC CTCGGCACTG A
 
Protein sequence
MTSTGIQLVT RSPQRVAADP GRVVTRLFVP GHEGFEHQES RAGAVLSRIL ALTDDEVRAA 
LQDVLIRFDG RHRDLTGTFR RHARELADRL DPTREFTEAR VLLLGATFTN EYSIEGAALC
NPSVVAHPDQ SGTVPGSLRF VMSVRGIGEG HRSSIGFRTG VVDSTGHATI DEPAPFASTG
RVEPTLLDAA VFRTELRDKG CGGEAADYVF DALGALFTRS DLDERLERLR AHLSTRGHVE
DTIATIRGVA ARCYAVEFPD DTTLSERVLW PEMEAEHAGM EDARFVRFVD DDGSIRYHAT
YTAYSGSHIS QQLLTTADFQ TFTSGPLVGS AAANKGLALF PRRIGGRYAA MSRSDRETNT
VAFADDLSVW TTALPCQQPA EVWETLQLGN CGPPIETDRG WLLLTHGVGP MRTYSIGAIL
LDLDDPTRVI GRLRRPLLTP AADDRDGYVP NVVYSCGALV HADTLVIPYG ICDSAIGLAT
VPLPDLLAEL AGSPRH