Gene Ndas_1175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1175 
Symbol 
ID9245025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1431003 
End bp1432112 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content77% 
IMG OID 
ProductPrephenate dehydrogenase 
Protein accessionYP_003679122 
Protein GI297560148 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.349516 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCGCA CCATGGCGGT GGTCGGCACG GGACTCATCG GAACGTCCGT GGCGCTGGCC 
GCGGGACGGC ACGGGGTCGC CGTCCACCTG ATGGACCGGG ACCCCGCCGC GGCCCGCACC
GCCGCCGCGC TGGGCGCCGG AACGGTCGGC GCCCCGGCCG AGGCGGTGGA CCTGGCCGTG
ATCGCGGTGC CGCCCAGCAT GGTCGGCGCC GTCCTGGCCG AGCAGCAGCT GCGCGGCCTG
GCCCGGGCCT ACACGGACGT GGCCAGCGTG AAGTCCGCGC CCGGCCGGGA CGTGCTGAGC
GCCATCGCGG ACCCGGCGAC GTTCATCGGC GGCCACCCCC TGGCCGGCCG GGAGCGGGCG
GGTCCCCTCG CGGCCCGCGC GGACCTGTTC GAGGGCCGCA CCTGGGTTCT CACACCGACG
GCGGCCACCG CGCGACCAGT GCTCAACCGG GCGCTGGAGA TGATCTGCCT GTGCGGCGCG
GTCCCGGTGA TGATGGACAG CCAGGCCCAC GACGACGCGG TGGCGCTGAC CTCGCACGCA
CCGCACGTGG TGGCGAGCCT CATGGCGGCG CGGTTGCGCG GCGGGGCCGA GGAGGCCTTC
CGCCTGGCCG GGCAGGGGTT GCGCGACACC ACCCGCGTCG CGGGCGGCGA CCCCCGGCTG
TGGACCGACA TCCTGCGCGC CAACTCCGGG CCGCTGGTCG GGGTGCTGCG CGACCTGCAC
GAGGACCTGT CACTGGTGCT GGCCTCCCTG GACGTGCTCT CCCGCTCCGG TCCGGGGCAG
GGCGCGCGCG AGACGGGCCG GGTGCGCGAC CTGCTGGACC GGGGTTCCCA GGGCCTGGGA
CTGCTCCGCG AGCAGCCGCC GGGCGGGGCG CGTCTGCGGG TGGCGGTGGA GGAGGCCCCC
GGAGAGCTGG CACGGCTGCT GGCGGTGCTG GACGAGTCCG GCGTCACCGC CGACGACGTG
TCCGCCTCCT GGGACCAGGA CACCCTGACG GCGGAGTTCG CGGCACCGGC CACCGCCGCC
GGGCCGCTGC TGAGGCGGCT GGGGGCGGAG GGCTGGACGG CCGGGTACGC GGACCTGGCG
ACGGACTCCG AGGTCGGCGC CCTGCGCTGA
 
Protein sequence
MIRTMAVVGT GLIGTSVALA AGRHGVAVHL MDRDPAAART AAALGAGTVG APAEAVDLAV 
IAVPPSMVGA VLAEQQLRGL ARAYTDVASV KSAPGRDVLS AIADPATFIG GHPLAGRERA
GPLAARADLF EGRTWVLTPT AATARPVLNR ALEMICLCGA VPVMMDSQAH DDAVALTSHA
PHVVASLMAA RLRGGAEEAF RLAGQGLRDT TRVAGGDPRL WTDILRANSG PLVGVLRDLH
EDLSLVLASL DVLSRSGPGQ GARETGRVRD LLDRGSQGLG LLREQPPGGA RLRVAVEEAP
GELARLLAVL DESGVTADDV SASWDQDTLT AEFAAPATAA GPLLRRLGAE GWTAGYADLA
TDSEVGALR