Gene Noca_3298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3298 
Symbol 
ID4598170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3503791 
End bp3505206 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content71% 
IMG OID639777904 
Product3-isopropylmalate dehydratase, large subunit 
Protein accessionYP_924487 
Protein GI119717522 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.405916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAGGA CCTTGGCCGA GAAGGTCTGG GACGAGCATG TCGTCCGGTC GACACCGGGG 
GAGCCGGACC TCCTCTACAT CGACCTGCAC CTGATCCACG AGGTCACCTC CCCGCAGGCC
TTCGACGGCC TCCGGCTCGC CGGCCGCACC GTGCGTCGCC CGGACCTCAC GCTGGCCACC
GAGGACCACA ACGTCCCCAC CCTCGACTGG GACAAGCCCA TCGCCGACCC GGTCTCGAAG
ACCCAGGTCG ACACGCTGCG CCGCAACGCC GCGGAGTTCG GAGTCCGGCT GCACCCCCTC
GGCGACGTCG AGCAGGGCAT CGTGCACGTC GTCGGCCCGC AGCTCGGGCT GACCCAGCCC
GGGATGACGA TCGTGTGCGG CGACAGCCAC ACCAGCACGC ACGGAGCATT CGGCGCGATC
GCGTTCGGGA TCGGCACCTC GGAGGTCGAG CACGTCCTCG CCACGCAGAC GCTGCCGCAG
GCCAAGCCGA AGACGATGGC CGTCACGGTC GAGGGCAGCC TGCCCGACGG CGTGACCGCG
AAGGACCTGG TGCTGACCCT GATCGCCCAC ACGGGCACCG GTGGCGGGCA GGGCTACATC
GTCGAGTACC GCGGCCCGGC CATCGAGGAG CTCTCGATGG AGGGCCGGAT GACCGTCTGC
AACATGTCCA TCGAGTGGGG GGCCAAGGCC GGCCTGATCG CCCCCGACCA GACGACGTTC
GACTACATCG AGGGCCGGCC GGAGGCGCCG AAGGGCGCCG ACTGGGACGC CGCCGTTGCG
CACTGGAAGA CGCTGGTCAC CGACGCGGAC GCGACGTTCG ACAAGGAGAT CGTGCTCGAC
GCGAGCACGA TGACGCCGTT CGTCACCTGG GGCACCAATC CCGGCCAGGG CGTGCCGCTC
GGGGGCAGCG TTCCGGACCC GGCGCAGTAC GACGACCCCT CGGACCGGAT CGCCGCTGAG
AAGGCATGCG AGTACATGGG CCTCGAGGCC GGCACGCCGA TGCGCGACAT CAAGGTCGAC
ACCGTCTTCA TCGGCTCGTG CACCAACGGC CGGATCGAGG ACCTGCGCGC GGCCGCGGAG
ATCATCAAGG GCCGCCAGGT CGACAAGTCC ACCCGGCTGC TCGTCGTACC GGGCTCGGTG
CGCGTGCGTC TCCAGGCCCA GGACGAGGGC CTCGACGTGA TCTTCAAGGA GGCCGGCGGC
GAGTGGCGCG GCGCGGGCTG CTCGATGTGC CTGGGCATGA ACCCCGACAC CCTGCAGCCC
GGTGAGCGCA GCGCCTCGAC GTCCAACCGC AACTTCGAGG GCCGCCAGGG CAAGGGCGGC
CGCACCCACC TCGTGTCGGT GCCGGTCGCC GCCGCGACCG CGATCCGCGG CACCCTGTCC
TCGCCCGCCG ACCTCGAGCC CGTTGGGAGC AACTGA
 
Protein sequence
MGRTLAEKVW DEHVVRSTPG EPDLLYIDLH LIHEVTSPQA FDGLRLAGRT VRRPDLTLAT 
EDHNVPTLDW DKPIADPVSK TQVDTLRRNA AEFGVRLHPL GDVEQGIVHV VGPQLGLTQP
GMTIVCGDSH TSTHGAFGAI AFGIGTSEVE HVLATQTLPQ AKPKTMAVTV EGSLPDGVTA
KDLVLTLIAH TGTGGGQGYI VEYRGPAIEE LSMEGRMTVC NMSIEWGAKA GLIAPDQTTF
DYIEGRPEAP KGADWDAAVA HWKTLVTDAD ATFDKEIVLD ASTMTPFVTW GTNPGQGVPL
GGSVPDPAQY DDPSDRIAAE KACEYMGLEA GTPMRDIKVD TVFIGSCTNG RIEDLRAAAE
IIKGRQVDKS TRLLVVPGSV RVRLQAQDEG LDVIFKEAGG EWRGAGCSMC LGMNPDTLQP
GERSASTSNR NFEGRQGKGG RTHLVSVPVA AATAIRGTLS SPADLEPVGS N