Gene Mvan_2642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2642 
Symbol 
ID4643985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2795902 
End bp2797149 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content68% 
IMG OID639806124 
Productaminodeoxychorismate lyase 
Protein accessionYP_953456 
Protein GI120403627 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACG ATTGGCAGAC TGACCGTGCC GAGCCGCTTG CGGTCGGGCC GCCCCGGCGG 
CGCTTGACAC GGCGGGAGCG CGCCCGGGAG GAGCGCTACC GGCGCAGGCG CAAGACCGCC
CTCGTCGTCA CGCTGTCGAT GCTCGTCGTG GTCGTCCTCG TCGCGGTGTT CCTCGGGTCC
AAGATGTGGC ATTCGCTCTT CGGCGGCCCC GGCGACGACT TCGCCGGTGC GGGCGTCAAC
GACGTCGTGA TCCAGGTCCA CGACGGCGAC TCCACCACCG CGATCGGCCA GACTCTGCAC
GACAACAACG TGGTCGCCAA CGTCAAGGTC TTCGTCGAGG CGGCCGACGG GAACGCGGCC
ATCTCGGCCA TCCAGCCGGG GTTCTACAAG GTGCGCACCG AGATACCGGC AGCCGATGCG
GTGGACCGGC TCGCCGACCC GGGTAACAGG GTCGGCAAGC TCGTCATCCC CGAAGGCCGT
CAGCTCGACG ACGTGCGTGA CGTCAAGACC AATGCCGTCA CCGAGGGCAT CCTGACCCTG
ATCTCCAAGG CGTCCTGCGT GGACCTCGAC GGCGACCGCC ACTGCGTGTC CGCAGACGAT
CTGAGGAACA CCGCAGCCGG CGCCGATCCG GCCGAGCTCG GCGTTCCCCA GTGGGCGACC
GAGCCCGTCG AGGCGCTCGG GTCGGACCTG CGCAGGCTGG AAGGTCTGAT CGCGCCGGGC
TCGTGGAACA TCGATCCGTC GGCCCAACCG CAGGACATCC TGTCGACGCT GATCTCGGCC
AGCGCGACAT TGTACGAACA GAATGGTCTG CTCGACGCGG CTGCGGCAGT GAACATGTCG
CCGTATCAGA TCCTCACGGT CGCGTCGCTG GTCCAGCGCG AAGCCACGCC GGAGGACTTC
TCGAAGGTGG CCCGGGTGAT CTACAACCGG CTTGCCGAGC GGCGCACGCT GGAGTTCGAC
TCCACGGTGA ACTACCCGCT GGACCGCATC GAAGTCGCCA CCACCGACGG CGATCGCGCC
CAGCTGACGC CATGGAACAC CTATGTGCGG CCGGGACTTC CGGCCACGCC GATCTGCTCT
CCCAGTCAGG CCGCGCTGGT GGCCGCCGAG AACCCGGAGC CGGGGGACTG GCTGTACTTC
GTGACGGTCG ACATGCAGGG CACGACGCTG TTCACCCGTG AGTACGAGCA GCACCTGGCC
AACATCGAGG TGGCGCAGCG CAACGGTGTC CTCGACTCGG CGCGGTGA
 
Protein sequence
MTDDWQTDRA EPLAVGPPRR RLTRRERARE ERYRRRRKTA LVVTLSMLVV VVLVAVFLGS 
KMWHSLFGGP GDDFAGAGVN DVVIQVHDGD STTAIGQTLH DNNVVANVKV FVEAADGNAA
ISAIQPGFYK VRTEIPAADA VDRLADPGNR VGKLVIPEGR QLDDVRDVKT NAVTEGILTL
ISKASCVDLD GDRHCVSADD LRNTAAGADP AELGVPQWAT EPVEALGSDL RRLEGLIAPG
SWNIDPSAQP QDILSTLISA SATLYEQNGL LDAAAAVNMS PYQILTVASL VQREATPEDF
SKVARVIYNR LAERRTLEFD STVNYPLDRI EVATTDGDRA QLTPWNTYVR PGLPATPICS
PSQAALVAAE NPEPGDWLYF VTVDMQGTTL FTREYEQHLA NIEVAQRNGV LDSAR