Gene Ndas_3373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3373 
Symbol 
ID9247238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4031351 
End bp4032571 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681284 
Protein GI297562310 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGGGGC CGCTCTCCCT GGCCCACGCG TTACAGCGCC TGGCGAGGAT CCCGTCCATG 
CGGCGCAAGT ACGGCCGTTA CCTCGTGCTG GGGCGCAGGC CCCGCCTGGT GGACCTGCCT
CCCCTCACCC GTGAGGAGCT CGGCGAGGCC GTCGACACGA TGATGCGGGA GTCCCCGGCC
GAACTCTCCC GGGCCTCGCT CCACCTCATG GGGGGAACGA CGTCCACGAG CCGGTTGGGC
GCGCTCCCCT CCGACCTCCA CGTGGACGAG ATCGCCCCGC ACCTTCAGCC CTTCGCACCG
GGGGACCTGG TCGCGAGCCT GAGCACCCCG TTCCACATGC GCGCGTCGCA CGACCTGCAC
AACGCGCTCG CCGCACGGGC CGGCGTCCCC ACCCTGTCGC TCGACGCCCC GACGGACCAG
ATGATCGAGC CGTGCCTCGA CCTCTTCGAG CGGCACGGGG TGAGCGCCCT CGCCGCCACC
CTGGACACCG TCCAGCGCGT CCTGCGCTTC TGCGCCGCGT CCGGACGGCG CCTGGACTTC
CTGCGCAAGG TGCTCTGGAG TGGACCGGCG ATGGACGCGG GGACGCGCTC GCTGATCCGC
ACGCACTTCC CCCACCTGCG CACATGGGCG CTGTTCGGCT CCGCGGAGAC CTGGATCATC
GGGCACAGCG GACCCGACTG CGCGATCGAC ACCTTCCACC CGCTCCCCTA CCAGCACACC
GAGGTCGTCA ACGGGCGCAT GCTCGTGACC GTCACGCACA GGAAGGCGGT CGTCCCCCTG
TTGCGCTACG ACACCGGCAC CGGGGCCGAG TGGACGTCGT GCACCTGCGA GCTGCCCGGC
CGGCCCCTGC GGACCAACAG CCGCATAGAC GCGCCCTACG GTCCGCTCAG CAGTCTGGTG
TCCCCCGCCG ACCTGGCGTC GCTCGCCCTC CAACTCGACT CGGTGGAAGC CGCCCAGGTG
GTCCTGATCC GTCCGCACAC CGAGAACGAG CGGCTGCGCC TGCGGGTCCG GCTCCGCCCC
GGGACGGAGC CGGACCTGTA CACCGTCGAG TGGATCCGGC ACCACGTGGT GTCGGGCTGC
CTGGCCCTGG CGGAGGTCAT CGAGGAGGCC CCCGAGACCT TCGAGGTGAC CCTCTCCCGG
CGGCTGCTGG ACCAGTCCTC GGACGGCTCG GCGCCGACGA TGGTGGTCCG CACCGCCTCC
CGGGGCCGCT GCTCCGCGTA A
 
Protein sequence
MEGPLSLAHA LQRLARIPSM RRKYGRYLVL GRRPRLVDLP PLTREELGEA VDTMMRESPA 
ELSRASLHLM GGTTSTSRLG ALPSDLHVDE IAPHLQPFAP GDLVASLSTP FHMRASHDLH
NALAARAGVP TLSLDAPTDQ MIEPCLDLFE RHGVSALAAT LDTVQRVLRF CAASGRRLDF
LRKVLWSGPA MDAGTRSLIR THFPHLRTWA LFGSAETWII GHSGPDCAID TFHPLPYQHT
EVVNGRMLVT VTHRKAVVPL LRYDTGTGAE WTSCTCELPG RPLRTNSRID APYGPLSSLV
SPADLASLAL QLDSVEAAQV VLIRPHTENE RLRLRVRLRP GTEPDLYTVE WIRHHVVSGC
LALAEVIEEA PETFEVTLSR RLLDQSSDGS APTMVVRTAS RGRCSA