Gene Ndas_3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3957 
Symbol 
ID9247828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4731595 
End bp4732719 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content73% 
IMG OID 
Productribonuclease BN 
Protein accessionYP_003681860 
Protein GI297562886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.707734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGCAG CGCTGGACCG GGCCAGGGAC ACCGGCAGGC ACTACCGGAA CCTGGGCATG 
GACGCGTACT GGGAGCTGCG CAGGCGCAGG CCCGCCCTCG ACCACCTGGT CCGGGCCTAC
GAGCGCTACG CCGACCGCAA CGGCAACCAG CTGGCCGGAG CGGTCACCTA CTTCGCCTTC
CTGTCCTTCT TCCCCCTGCT GGCGCTGGCC TTCGCCGCGG TCGGCTACCT CGCCGCCGTG
CAGGTCGAGG TGGGCGACTA CCTCCAGCAG GCGCTGGACG GCGTCCTGCC GGGCCTGTCC
GAGCAGCTGC CCATCGACGA GATCGCCCAG GCCCGCGTGG GCGCCGGTGT CATCGGTGTG
CTCGGTCTGC TCTACGCGGG CCTGAACGCG GTGTCGGCGC TGCGCGAGGC CCTGCACTCC
ATCTGGCTCA AGAACCTCAG GGAGGGTCCC AACATCCTCC TGCGCAAGCT GGCCGACCTG
CTGGTCATGC TGGGCCTGGG CGCGGCGCTG CTGCTCGCGG TGGCCTTCAC CAGCGTCGCC
CAGACCGCCA CGCAGTGGCT CCTCGGGCTG GTGGGCCTGG ACGGTTCGCT CCTGGCCAAC
CTGTCCCTGC GCGCGCTGGC CCTGGTCATC GCGGTCGGCG CCAACATAGT GATCTTCGTG
CTGGCCTTCG CGCTGCTGTC GGGCAGCGGG CGGCCCACGC GGATGATGTG GAGGGGAGCC
CTGCTGGGGG CGGTCGGATT CGAGGTCCTC AAGGCCGCGG CGGCGGTGCT GCTGGCCGGG
ACGCTCGGCA ACCCCGTCTA CGCGTCCTTC GCCGTCCTGG TCGGGCTGCT GGTGTGGATC
AACCTCGTGA TGCGCCTGGT GATGTTCAGC GCGGCGTGGA CGGCCACGTG GCTGCCGATG
CCGCCGCCCT ACACCGGCTC CCTGCCCCTG CCGGAGGAGC ACGGCATGGA CTGGACCCGC
CCGGACTGGG TCTCGGGGGT GACGGCGCGG GTGGCGGCGC GGGAGGCGGC CGAGCGCCGC
CGGGCACGGC GCGCGCTGGC CGGGGTGTTG TCCCTGCTGG GAGCGGCCGG TGCCGCGGGC
ACCACGGCGT GGGCGCTGCG CCGACGCCGC GGCGATCGCG TGTAG
 
Protein sequence
MAAALDRARD TGRHYRNLGM DAYWELRRRR PALDHLVRAY ERYADRNGNQ LAGAVTYFAF 
LSFFPLLALA FAAVGYLAAV QVEVGDYLQQ ALDGVLPGLS EQLPIDEIAQ ARVGAGVIGV
LGLLYAGLNA VSALREALHS IWLKNLREGP NILLRKLADL LVMLGLGAAL LLAVAFTSVA
QTATQWLLGL VGLDGSLLAN LSLRALALVI AVGANIVIFV LAFALLSGSG RPTRMMWRGA
LLGAVGFEVL KAAAAVLLAG TLGNPVYASF AVLVGLLVWI NLVMRLVMFS AAWTATWLPM
PPPYTGSLPL PEEHGMDWTR PDWVSGVTAR VAAREAAERR RARRALAGVL SLLGAAGAAG
TTAWALRRRR GDRV