Gene Ndas_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1973 
Symbol 
ID9245823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2396558 
End bp2397724 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content73% 
IMG OID 
Productaminotransferase class V 
Protein accessionYP_003679906 
Protein GI297560932 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000551025 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000445037 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGGCGTGG ATGTGGCTGC GGTGCGGGCG GACACCTCTG GGGTTGGGCA GGCGGCGCAT 
CTGGACAACG CGGGGTCGTC GTTGCCGCCG GACGTGGTGG TGGAGGCGGT GGTGGATCAC
CTGCGCCTGG AGGCGCGGGT GGGGGGTTAC GCGGCGGCCG AGCTGGCGCA GGCGCGGGTG
GAGGGGTTCT ACACCGCCGT GGCGCGGTTG GTGGGTGCGC GCCCGCAGGA GATCGCGTTC
ACCGAGAGTG CGACGCGGGC GTGGGAGTTG GCGTTCGGGT CGGTGGTCTT CGCCGAGGGG
GACCGGGTGC TGACCACGGC CAGTGAGTAC CCCAGTAACG CGTTGGGGAT GGTCAAGGCG
GCCCGTGAGC GGGGTGTGCG GGTGCAGGTG GTGCCCGACG ACGCCGACGG GGTGATGGAC
GTGGGGGCGC TGGAGCGGGA GTTGGCGCGG GGCGGGGTGC GGGTGGTGGC GATCAACCAC
ATGCCCACGC ACAACGGGCT GGTCAACCCG GCCGAGCGGA TCGGGGCGCT GTGCCGCAGG
TTCGGGGTGT TGTTCGTGTT GGACGCGTGC CAGTCGGCGG GCCAGTGGGA TCTGGACGTG
GAGCGGCTGG GGTGTGACGT GTTGGCGGTG ACGGGGCGCA AGTTCCTGCG CGGGCCGCGG
GGGACGGGGT TCGTGTACGT GCGGTCGGGG GCGGACCTGG GGGAGCCGCC GGTGGTGGAC
GTGACGTCGG CGCACTGGGA GGGGCGCGGG TACCGGGTGC GTGAGGACGC GCGCCGGTTG
GAGAGCTTCG AGCGCAACGT GGCCGGTCAG ATCGGGTTGG GTGTGGCCGT GGACTACGCG
TTGGCGGTGG GGATGGAGCC CATCCGGGAG CGGGTGGGGG CGCTGGCCGA GCAGGCCCGG
GACCGGTTGG GGCGTCTGGC GGGGGTGCGG GTGCTGGACC GGGGGAGGGT GCGGTCGGGG
ATCGTGACGT TCGCGGTGGA GGGTGTGGCG GCAGAGGGGG TGCGTGCGGC GCTGGGGGAG
GCCGGGGTGC GGGTGAGCGT GTCGCGGTTG TGGAACCAGG TGTGGGAGCC CGGTGTGGGG
GTGGACGAGG CGGTGCGTGC CTCGGTGCAC TACTTCAACA CCGAGGCCGA GGTGGAGGCG
TTGGCGGACG CGGTCAGGGG GTTGTAG
 
Protein sequence
MGVDVAAVRA DTSGVGQAAH LDNAGSSLPP DVVVEAVVDH LRLEARVGGY AAAELAQARV 
EGFYTAVARL VGARPQEIAF TESATRAWEL AFGSVVFAEG DRVLTTASEY PSNALGMVKA
ARERGVRVQV VPDDADGVMD VGALERELAR GGVRVVAINH MPTHNGLVNP AERIGALCRR
FGVLFVLDAC QSAGQWDLDV ERLGCDVLAV TGRKFLRGPR GTGFVYVRSG ADLGEPPVVD
VTSAHWEGRG YRVREDARRL ESFERNVAGQ IGLGVAVDYA LAVGMEPIRE RVGALAEQAR
DRLGRLAGVR VLDRGRVRSG IVTFAVEGVA AEGVRAALGE AGVRVSVSRL WNQVWEPGVG
VDEAVRASVH YFNTEAEVEA LADAVRGL