Gene Mvan_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3035 
Symbol 
ID4647217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3199953 
End bp3201767 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content69% 
IMG OID639806513 
Producthypothetical protein 
Protein accessionYP_953844 
Protein GI120404015 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3653] N-acyl-D-aspartate/D-glutamate deacylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGCGC TGTTCGATGA GCCTGTGACT TTCGACACGA TCATCCGCAA CGGCCGTTGG 
TTCGACGGCA CCGGCGCGCC GTCCGCGGTG CGCAACCTCG GCATCCGCGA CGGACACGTC
GTGTCCATCA GCGAGGACGA GCTGGACGCG ACCGGATGCC CGCAGGTGAT CGACGCCGTG
GGCAAGTGGG TCCTGCCGGG CATGCTCGAC ATCCACACGC ACTACGACAT CGAGGTGCTG
GGTGGTCCGT CGCTGGCCGA GTCACTGCGT CACGGCGTGA CCACGGTGCT CCTCGGATCG
TGCTCGCTGT CGACCGTCTA CGTCGACGGC GTCGATGCCG GCGACATCTT CGGCCGTGTC
GAGGCCATCC CCCGCGAGCA CGTCATCGCG GCCGTCGACA GACACAAGAC CTGGACCACC
GGCGAGGAGT ACATCGCGGC GCTGGAATCA CGCCCGCTCG GACCCAACGT GGCGGCATTC
ATCGGGCACT CGGACATGCG GACCGCCACC ATGGGGCTGG ACCGGGCAAC CCGCAAGGAC
GAACGGCCCA CCGGCAGCGA GCAGGCGCAG ATGGAGCGGA TGCTGACCGA GGCACTGCGC
GCCGGATTCG TCGGAATGTC CTCTCAGCAA CTGCTTTTCG ACAAATTGGA CGGCGAGGTG
TGCCGCTCCC GGACGTTGCC GTCGACGTAC GCGAAGCCCC GGGAGCTGCG CAGGCTGAAA
TCCCTGCTGC GCCGCTCCGG GCGGGTTCTG CAGTCCGGGC CCGACATCAC GAACCCGCTG
AACGTGGCCT CGCAGGTGTT CCAGTCGCTC GGCCTTGTCC GCAATCCTCT CAAGACCAGC
CTGCTCTCGG CGGCCGACGT CAAGTCCAAC CCGGTGGCGG TCAAGCTGCT GGGCCCGCTC
GCGCGGCTGG CGAACCGCCT CGGCGGCAAC TTCCGCTGGC AGCACCTTCC CGTGCCGTTC
GAGGTGTACG CCGACGGCAT CGACCTGGTC ATCTTCGAGG AGTTCGGCTC GGGTGCGGCC
GCGCTGCACC TGGCCGACGA GGTCGACCGC AACGCGCTGT TGCGCGACGA GTCCTACCGG
CGCCGGTTCC GTAAGGACTA CGACGCGAGG TTCGGGATGC GGGTCTGGCA CCGGGACTTC
TTCGACGCCC AGATCGTCGC CTGCCCCGAT GTGTCGGTGA TCGGCAAGTC GTTCGGCGAG
GTCGGCCGCG AGCGCGGCGG ACTGCATCCG GTGGACGCGT TCCTCGACCT CGTCCTCGAA
CACGGCAAGG GGCTGCGGTG GCGCACCACG ATCTCCAACC ACCGCCCCGA GGTGCTCAGG
AAGCTGGCCG CCGACCCCGG GATCCAGATG GGCTTCTCCG ACGCCGGTGC TCACCTGCGC
AACATGGCGT TCTACAACAT GGGCCTGCGG CTGCTGCGCC ATGTCCAGGA CGCGGCGCGG
TCCGGGCGGC CGTTCATGAC CGTCGAACAG GCCGTGCACC GGTTGACCGG TGAGCTGGCC
GACTGGTACC GCATCGACGC CGGACATCTG CGCCTCGGCG ACCGCGCCGA TCTGGTGATC
GTCGACCCGG AACGGCTCGA TGACACACTG GACCGCTACG CCGAGGAACC GGTGGAGCAG
TACGGCGGGC TGTCCCGGAT GGTGAACCGA AACGACGACA CCGTCGCCGC CGTGTTCGTC
GGCGGCCGGG CGGTGTTCCT GGACGGGAGA CCCACCCCGC TGGTCGGCGC GGAACGGACC
GGACGCTTCC TGCGGGCCGC CCACAAGGCG CCCGCACTCC TCGCACAGAA CGGAGCGCTC
GCCCATGCCG GTTGA
 
Protein sequence
MRALFDEPVT FDTIIRNGRW FDGTGAPSAV RNLGIRDGHV VSISEDELDA TGCPQVIDAV 
GKWVLPGMLD IHTHYDIEVL GGPSLAESLR HGVTTVLLGS CSLSTVYVDG VDAGDIFGRV
EAIPREHVIA AVDRHKTWTT GEEYIAALES RPLGPNVAAF IGHSDMRTAT MGLDRATRKD
ERPTGSEQAQ MERMLTEALR AGFVGMSSQQ LLFDKLDGEV CRSRTLPSTY AKPRELRRLK
SLLRRSGRVL QSGPDITNPL NVASQVFQSL GLVRNPLKTS LLSAADVKSN PVAVKLLGPL
ARLANRLGGN FRWQHLPVPF EVYADGIDLV IFEEFGSGAA ALHLADEVDR NALLRDESYR
RRFRKDYDAR FGMRVWHRDF FDAQIVACPD VSVIGKSFGE VGRERGGLHP VDAFLDLVLE
HGKGLRWRTT ISNHRPEVLR KLAADPGIQM GFSDAGAHLR NMAFYNMGLR LLRHVQDAAR
SGRPFMTVEQ AVHRLTGELA DWYRIDAGHL RLGDRADLVI VDPERLDDTL DRYAEEPVEQ
YGGLSRMVNR NDDTVAAVFV GGRAVFLDGR PTPLVGAERT GRFLRAAHKA PALLAQNGAL
AHAG