Gene Mvan_2333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2333 
Symbol 
ID4646110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2490417 
End bp2491715 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content67% 
IMG OID639805817 
ProductNLP/P60 protein 
Protein accessionYP_953153 
Protein GI120403324 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.554645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0124113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGGC CCGCTGCTGT GGCGCTCGCG ATCACTGTCA CCACGGTCGT CTGCGGAGTC 
GGCCCGGCCG CATCGGTGCG CGCCGAGCCG GAGGCACCGG ATGCCGGCAC CCTGGCCTCG
CTGATCGTCA GGATTGCTGA GGCCGATCAG TCGCTACACG ACCTCGACGC GCAGATCCAG
ACATTGCGGG AGGAGGTGAA CAAGACGATC GCCGATCTCG CCACGGCTCA CGAACACGCC
GCGCGGGCCG ACCTGGACGC CGAGGCCACC GCTGTGTCCC TGCACGATGC GGTTCGCGCC
GTCGATACCG CCCTGGAGCG GTTCGACCGA TTCGCGGTTG CGACCTACGT GCACGGTCCC
GGCATCTCGC TGGTGGCACC GGACGGGCCG GAGGATCTCC TGGGAAACGT GGCCTACGCC
AATGCACTGC ACATCAGTGC TGCCAACGCA GCTGAGGATC TGCAGCGAGC GCGGACCGAA
CAGACCAACA GAAACTCGGC CGCTCGGCAG GCCCGCCAAC GAGCCGATGC AAGTCTCAAT
TCGGCGCGAC GCCGCCAACT CGACGCCGTG GCCGCGCTCA GAACCGCGCA GGATCGGCTG
GCCGCGAAGC AAACCGAGTT CGCCGCTCTT GTCGACCAGC GTGATCATGC TGTCGCAGCG
GTGAATTCGA CGTCACTCAC TGTGACACGT CGGGACGAAC TCATACTCCA GCCGTCGATC
ACGAGGCTGC CCCGCGACCT CGGTGCGGTC ATCGATTCCC TCGTGGCGAT CGCACGCGAC
TCCACCAGTG CGACCGCCGA GATGGGCCGC GCGTTCCTCG CACAGGCGGG GGTTGGACCG
GACGCTGTGT CGATATCTGC CTCTGCGCCC ACCGGTGAGC TGCCACGGTC GGTGAGGTGG
CAGGCCTCCG AGTACGTTGT GGCGCGCGCG CTTTCGCAGC GTGGGGTGCC GTATTCGTGG
GGTGGCGGAG CCGCGACCGG ACCCAGCCGC GGTATCGACA CCGGCGCGGA CGTGGTCGGA
TTCGACTGCT CGGGTCTGAT TCTGTACGCG TTCGCCGGTG TGGGGATTGC GTTGCCCCAC
TACACCGGCC ACCAGTATCA GGCCGGCCGC CAGGTTCCCG TCACCCAGAT GCGTCGTGGC
GACGTGATCT TCTTCGGACC CGGCGGCAGC GAGCACGTCG CGCTCTATCT CGGCAACGGC
GTCATGCTCG AGGCGCCAAG ACCGGGGCAG TTCGTCCAGG TCTCACCCGT GCGGACGTCC
GGTATGACCC CGTACGCCGT GCGGTTCATC GAGTACTGA
 
Protein sequence
MARPAAVALA ITVTTVVCGV GPAASVRAEP EAPDAGTLAS LIVRIAEADQ SLHDLDAQIQ 
TLREEVNKTI ADLATAHEHA ARADLDAEAT AVSLHDAVRA VDTALERFDR FAVATYVHGP
GISLVAPDGP EDLLGNVAYA NALHISAANA AEDLQRARTE QTNRNSAARQ ARQRADASLN
SARRRQLDAV AALRTAQDRL AAKQTEFAAL VDQRDHAVAA VNSTSLTVTR RDELILQPSI
TRLPRDLGAV IDSLVAIARD STSATAEMGR AFLAQAGVGP DAVSISASAP TGELPRSVRW
QASEYVVARA LSQRGVPYSW GGGAATGPSR GIDTGADVVG FDCSGLILYA FAGVGIALPH
YTGHQYQAGR QVPVTQMRRG DVIFFGPGGS EHVALYLGNG VMLEAPRPGQ FVQVSPVRTS
GMTPYAVRFI EY