Gene Noca_3096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3096 
Symbol 
ID4597881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3297862 
End bp3299199 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content71% 
IMG OID639777702 
Product3-deoxy-D-arabinoheptulosonate-7-phosphate synthase 
Protein accessionYP_924285 
Protein GI119717320 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACGA TCCCCGACCT CGCGACCCTG CACGCCATGG GCGCCGCCCA GCAGCCGTCG 
TACCCCGACC GCGCCGCCGT CGACTCGGCG GTCCAGCGGT TGCGGACGGC CCCGCCGCTG
GTGTTCGCCG GCGAGTGCGA CGACCTGAAG GGCAAGATCG CCCAGGTCGC GCGCGGCGAG
GCGTTCCTGC TGCAGGGGGG CGACTGCGCC GAGACGTTCG CCGGCGTGAC CGCCGACAAC
GTGCGCAACA AGCTGCGGGT GCTGCTGCAG ATGGCGGTCG TGCTGACCTA CGCCGCGTCC
GTGCCGGTCG TGAAGGTCGG CCGGATCGCC GGGCAGTACG CCAAGCCGCG CTCCTCGGAC
TTCGAGACCC GCGACGGGGT GACGCTGCCG GCCTACCGCG GCGACGCGGT CAACGGCTAC
GACTTCACCG CGGAGTCGCG GGTGCCCGAC CCGCAGCGGC TGGTCGACGT CTACAACTCG
TCGGCCGCGA CGCTGAACCT GGTGCGCGCG TTCGTGACCG GCGGCTACGC CGACCTGCGC
CAGGTGCACA CCTGGAACAC CGACTTCGTG CGGGAGTCGC CGGTCGGCCA GCAGTACGAG
GCGATGGCGA ACGAGATCGA GCGCGCGCTG ACCTTCATGC GGGCGATCGG CGCGGACCCC
GATGAGTTCC ACCGGGTCGA CTTCCACTCC AGCCACGAGG CGCTGGTCCT GGAGTACGAG
CAGGCCCTGA CGCGCATCGA CTCGCGCACG AGCACGCCGT ACGACGTCTC CGGGCACTTC
CTCTGGATCG GCGAGCGGAC CCGCCAGCTG GACGGCGCGC ACGTCGAGCT GCTGAGCCAC
ATCCGCAACC CGATCGGCGT GAAGCTCGGC CCGACGACCA CGCCTGACGA CGCGCTGGCG
CTCGCCGCGA AGCTCAACCC GGACAACGAG CCAGGCCGGC TCACGTTCAT CACCCGCTTC
GGTGCCGGCA AGATCCGCGA CGGCCTGCCG ACGCTGGTCG AGAAGGTCAC CGCCGCCGGG
CTCGAGGTCG CGTGGGTGTG CGACCCGATG CACGGCAACA CCTTCGAGGC GAGCTCGGGC
TACAAGACCC GCCGCTTCGG CGACGTGATC GACGAGGTGC AGGGCTTCTT CGACGTACAC
CGCTCGCTCG GCACCTGGCC CGGTGGCCTG CACGTCGAGC TCACCGGCGA CGACGTGACC
GAGTGCGTCG GCGGGGGCGA GGACCTGATG GAGGTCGACC TCGGCAACCG CTACGAGTCG
GTGTGCGACC CGCGGCTCAA CCGGGTCCAG TCGCTCGAGC TGGCGTTCCT CGTCGCGGAG
ATGCTCCGGC AGGCCTGA
 
Protein sequence
MSTIPDLATL HAMGAAQQPS YPDRAAVDSA VQRLRTAPPL VFAGECDDLK GKIAQVARGE 
AFLLQGGDCA ETFAGVTADN VRNKLRVLLQ MAVVLTYAAS VPVVKVGRIA GQYAKPRSSD
FETRDGVTLP AYRGDAVNGY DFTAESRVPD PQRLVDVYNS SAATLNLVRA FVTGGYADLR
QVHTWNTDFV RESPVGQQYE AMANEIERAL TFMRAIGADP DEFHRVDFHS SHEALVLEYE
QALTRIDSRT STPYDVSGHF LWIGERTRQL DGAHVELLSH IRNPIGVKLG PTTTPDDALA
LAAKLNPDNE PGRLTFITRF GAGKIRDGLP TLVEKVTAAG LEVAWVCDPM HGNTFEASSG
YKTRRFGDVI DEVQGFFDVH RSLGTWPGGL HVELTGDDVT ECVGGGEDLM EVDLGNRYES
VCDPRLNRVQ SLELAFLVAE MLRQA