Gene Noca_2414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2414 
SymbolaroB 
ID4598250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2574006 
End bp2575103 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content72% 
IMG OID639777017 
Product3-dehydroquinate synthase 
Protein accessionYP_923606 
Protein GI119716641 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGACA CCGTCGTCCG GGTGACCGGG GCCGCGCCGT ACGACGTCGT CATCGGCCAC 
GACCTGTCCG AGCGGCTGCC GCAGCTGCTC GGGGTCGGGG TGCAACGGGT CGCGGTCGTG
TTCTCCGACG CGTTGGCCGA GCTGGTGAAC CCCGTCCTCG ACTCGCTGGC GGCGGCGTAC
GACGTGATGG TGCTCCCGAT CCCCGACGGC GAACGGGCGA AGAAGGCTGC GGTCGCGGTC
TCGTGCTGGG AGGCGCTCGG TGAGGCCGGG TTCACCCGCT CCGACGCGGT CGTGACCTTC
GGGGGCGGTG CGACCACGGA CGTCGGTGGC TTCGTCGCCG CGACCTGGCT GCGCGGCGTG
AAGGTCGTGC ACGTGCCGAC CACGCTGCTC GGCATGGTCG ACGCCGCGGT CGGCGGCAAG
ACCGGCGTGA ACACCCGCAG CGGCAAGAAC CTGGTGGGTG CCTTCCACGA GCCGGCCGGC
GTGCTCTGCG ACCTGTCGAC GCTGCGCTCC CTGCCCCGCG CCGAGCTGCT CGCCGGGCTC
GGCGAGGTCA TCAAGTGCGG GTTCATCGCC GACCCCGCGA TCCTCAACCT CGTCGAGGGC
AACGAGGCCT CCCGCCTCGA TGCCGACTCC CTCGTCCTGC GCGAGCTGGT CGAGCGTGCC
GTGCGGGTGA AGGCCGGGGT CGTCTCGGCC GACCTCAAGG AGACCGGCGG CCGCCCGGAC
GACCCGGGCC GGGAGATCCT CAACTACGGG CACACGATGG CGCACGCCAT CGAGCGCACC
GAGAACTACC GGATCCGGCA CGGCGAGGCC GTCTCGATCG GGTGCGTGTA CGTCGCGGAG
CTGGCCAACC GGGCCGGCAC CCTCGCCTCC GACATCGTCG AGCGGCACCG GCACGCGTTC
GCCCGCGTCG GGCTGCCCAC GTCGTACTCC AAGGCCTCCT TCGACGACCT GCACCGGGCG
ATGCGGGTCG ACAAGAAGGC GCGCGGCTCC CAGCTGCGCT TCATCGTGCT CTCCGACCTC
GCGGTCCCGA CCGTGCTGGC CGGACCGTCC GTGGTGGACC TGCGCGACGC CTACGCCGCG
ATCGGTGGCC CCGCGTGA
 
Protein sequence
MRDTVVRVTG AAPYDVVIGH DLSERLPQLL GVGVQRVAVV FSDALAELVN PVLDSLAAAY 
DVMVLPIPDG ERAKKAAVAV SCWEALGEAG FTRSDAVVTF GGGATTDVGG FVAATWLRGV
KVVHVPTTLL GMVDAAVGGK TGVNTRSGKN LVGAFHEPAG VLCDLSTLRS LPRAELLAGL
GEVIKCGFIA DPAILNLVEG NEASRLDADS LVLRELVERA VRVKAGVVSA DLKETGGRPD
DPGREILNYG HTMAHAIERT ENYRIRHGEA VSIGCVYVAE LANRAGTLAS DIVERHRHAF
ARVGLPTSYS KASFDDLHRA MRVDKKARGS QLRFIVLSDL AVPTVLAGPS VVDLRDAYAA
IGGPA