Gene Ndas_4790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4790 
Symbol 
ID9248673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5677263 
End bp5678732 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content74% 
IMG OID 
ProductPropeptide PepSY amd peptidase M4 
Protein accessionYP_003682680 
Protein GI297563706 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCCC ATGCCCCCGA CCGGCTCGAC TCCCCCGAGA CGCCTCCTCC GGACCGCCGC 
TCCCCCGTCT GGGCGGCCCT GCGCCAACTC CTCCTACGCC TGCACTTCTA CGCCGGAGTC
CTCGTCGCGC CGTTCATCGC CGTGGCCGCC CTCACCGGAC TGCTGTACGC GTACGCGCCG
CAGCTGGAGC AGGTGCTCTA CGCCGACCAG CTCCGCGTCA CCCCCGGCGA CAGCGCGCTG
CCCCTCCAGG AACAGGTGGA CACCGCGGTC GCGGCACGTC CCGAGGGAAC GCTGAGCGCC
GTGCGCCCGG GCACGGCCCC CGACGAGAGC ACCCGCGTCC TGCTGGACGT GGAGGGGCTC
CCCGAGAGCT ACCGGCTCGC CGTCTTCGTC GACCCCTACA ACGGCGAGGT CCTCGGCGAG
ACGACCAGCT ACGGCAGCAG CGGCGCGCTG CCCGTGCGGT CGTGGCTGTC CGAACTCCAC
CGTCACCTGC ACCTGGGTGA GTTCGGCCGC CTGTACGGCG AACTCGCCGC GAGCTGGCTG
TGGGTGGTCG CTCTGGGCGG GGTCGTGCTG TGGACCGCGC GCCGGCGCAA GGCCCGCCGC
CTGCGCCGCA CCCTGCTGCC CGAGCCCTCC GCCAAGGGGC GGAACCGGAC CATGTCCTGG
CACGGGTCGC TCGGCATCTG GGCCGTAGCC GGCCTGCTGA TGGTGTCCGC CACCGGACTG
ACCTGGTCCC GGTTCGCCGG GGCCAACGTG ACCGACATGC GGGTGGCCCT CGGCTGGACC
ACGCCGTCGC TGTCGGCGCC CGCCCCGTCG GAGCACGCGG ACCACGAGGA GCAGGCGGAC
CACGCGGGGC ACGAGGACCA CGGCGGGCAC GGCGACCACG CGGACCACGG CGCGGCCGGA
CAGGACGTGG ACCTGGACAC GGTCCTGGCG TCCGTGCACG CCGCGGGGAT CGACGGCCCG
ATGGAGATCT CGGTGCCCGT GGAGGAGGGC GCGCCCTTCA CCGTCCAGGG AACCGGGCGG
AGCTGGCCCG TGCACCAGGA CGCGGCGGCC GTGGACGCGA CGAGCGGCGA GGTGGTCGAG
GAGCTGCGGT TCGAGGACCA TCCCTTCGTC GCCAAGATGG CCACGTGGGG GATCGCCTTC
CACATGGGTC TGCTGTTCGG CCTGCCGAAC CAGCTCCTGC TGACGGGGAT CGCCCTGTCC
GCGCTCTTCC TCGTGTTCTG GGGCTATCGG ATGTGGTGGC TGCGCCGCCC GACCCGGGAC
ACCGCGTTCG CCATGGGACG GCCGCTCGCC CCGCGCGGGA CCTGGCGGGG GCTGCCGTGG
TGGTGCCTGG CCCTGGTCGC CGCGGCGGCG GTGGGCGTCG GCCTGTTCCT GCCGGTGTTC
GGGGTCTCCC TGCTGGCCTT CCTCGTGGTG GACGCGGCCC TGGGCCTGCG GCGCGGGCGC
ACGCGCCCGG AGTCCGTCCC GAAGCCGTGA
 
Protein sequence
MTSHAPDRLD SPETPPPDRR SPVWAALRQL LLRLHFYAGV LVAPFIAVAA LTGLLYAYAP 
QLEQVLYADQ LRVTPGDSAL PLQEQVDTAV AARPEGTLSA VRPGTAPDES TRVLLDVEGL
PESYRLAVFV DPYNGEVLGE TTSYGSSGAL PVRSWLSELH RHLHLGEFGR LYGELAASWL
WVVALGGVVL WTARRRKARR LRRTLLPEPS AKGRNRTMSW HGSLGIWAVA GLLMVSATGL
TWSRFAGANV TDMRVALGWT TPSLSAPAPS EHADHEEQAD HAGHEDHGGH GDHADHGAAG
QDVDLDTVLA SVHAAGIDGP MEISVPVEEG APFTVQGTGR SWPVHQDAAA VDATSGEVVE
ELRFEDHPFV AKMATWGIAF HMGLLFGLPN QLLLTGIALS ALFLVFWGYR MWWLRRPTRD
TAFAMGRPLA PRGTWRGLPW WCLALVAAAA VGVGLFLPVF GVSLLAFLVV DAALGLRRGR
TRPESVPKP