Gene Mflv_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_2971 
Symbol 
ID4974292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp3141243 
End bp3142649 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content70% 
IMG OID640457193 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001134236 
Protein GI145223558 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTGGA CCGTCGACGT ACCCATCGAC CAGCTGCCCG CGCTCCCGCC GCTGCCCGCG 
GATCTGCGGC AGCGTCTCGA CGCCGCGCTG GCCAAGCCGG CGCTGCAGCA GCCGTCCTGG
GACGCCGGCC AGGCCGCCGC GATGCGCAAG GTGCTCGAGA GCGTGCCGCC GGTGACGGTG
CCGTCGGAGA TCGAGCGGCT CAAGGGGCAG CTGGCCGACG TCGCGCTGGG CAAGGCGTTC
CTGCTGCAGG GCGGCGACTG CGCCGAGACG TTCGTCGACA ACACCGAGCC GCACATCCGC
GCGAACATCC GCACGCTGCT GCAGATGGCC GTGGTGCTGA CCTACGGCGC GAGCATGCCG
GTGGTGAAGG TGGCCCGCAT CGCCGGGCAG TACGCCAAGC CGCGCTCGTC GGACGTCGAC
GCGCTGGGTC TGAAGTCCTA CCGCGGCGAC ATGGTCAACG GCTTCGCACC GGACGCCGCG
GTGCGCGACC ACGACCCGTC GCGACTGGTG CGCGCCTACG CCAACGCCAG CGCGGCGATG
AACCTGGTGC GCGCGCTGAC GTCGTCGGGG ATGGCGTCGC TGCACCAGGT GCACGATTGG
AACCGGGAGT TCGTCCGGAC CTCGCCCGCG GGTGCGCGCT ACGAGGCGCT GGCCGGCGAG
ATCGACCGCG GACTGCGGTT CATGAGTGCG TGCCGGGTCG ATGACCGCAA CCTCGACACC
GCCGAGATCT ACGCCAGCCA CGAGGCGCTG GTGCTCGACT ACGAGCGCGC GATGCTGCGC
ATGGAGACCG GTGACCTGGC CGACCCCGCT CTGTCGTCGG AGCCGAAGCT CTATGACCTG
TCGGCCCATT ACGTGTGGAT CGGGGAGCGC ACCCGTCAGC TCGACGGCGC GCACGTCGCG
TTCGCCGAGG TGATCGCCAA CCCGATCGGC ATCAAGATCG GGCCGACGAC CTCTCCCGAG
CTTGCGGTCG AGTACGTCGA GCGCCTGGAC CCGAACAACG AGCCGGGCCG GCTCACCCTG
GTCAGCCGGA TGGGTAACCA CAAGGTCCGC GACGTGCTGC CGCCGATCAT CGAGAAGGTG
CAGGCGTCGG GCCACCGCGT CATCTGGCAG TGCGACCCGA TGCACGGCAA CACCCACGAG
TCCTCGACGG GCTACAAGAC CCGCCACTTC GACCGCATCG TCGACGAGGT GCAGGGCTTC
TTCGAGGTGC ACCGGGCGCT GGGCACCCAC CCCGGCGGCA TCCACGTCGA GATCACCGGT
GAGAACGTCA CCGAGTGTCT CGGTGGGGCA CAGGACATCT CGGACACCGA CCTGGCGGGG
CGCTACGAGA CGGCGTGCGA TCCGCGGCTG AACACGCAGC AGTCGCTGGA GCTGGCGTTC
TTGGTCGCGG AGATGCTCCG CGATTAG
 
Protein sequence
MNWTVDVPID QLPALPPLPA DLRQRLDAAL AKPALQQPSW DAGQAAAMRK VLESVPPVTV 
PSEIERLKGQ LADVALGKAF LLQGGDCAET FVDNTEPHIR ANIRTLLQMA VVLTYGASMP
VVKVARIAGQ YAKPRSSDVD ALGLKSYRGD MVNGFAPDAA VRDHDPSRLV RAYANASAAM
NLVRALTSSG MASLHQVHDW NREFVRTSPA GARYEALAGE IDRGLRFMSA CRVDDRNLDT
AEIYASHEAL VLDYERAMLR METGDLADPA LSSEPKLYDL SAHYVWIGER TRQLDGAHVA
FAEVIANPIG IKIGPTTSPE LAVEYVERLD PNNEPGRLTL VSRMGNHKVR DVLPPIIEKV
QASGHRVIWQ CDPMHGNTHE SSTGYKTRHF DRIVDEVQGF FEVHRALGTH PGGIHVEITG
ENVTECLGGA QDISDTDLAG RYETACDPRL NTQQSLELAF LVAEMLRD