Gene Ndas_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1810 
Symbol 
ID9245660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2215826 
End bp2216869 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content73% 
IMG OID 
ProductAbortive infection protein 
Protein accessionYP_003679744 
Protein GI297560770 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0409959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGCA ACGACGGCCG CGCTCCCCGG GCTCCCCTTC CGCAGACGGA GGCCCCCCTC 
CCCGCACCCT CGGGTGCGGG GAGGGCGCAG GCGCCCGCGC GCACACCCTC CGGCCCGGCC
GCGCCGCACC CGCGCCGCCG CGGCCCGTGT CCCCTCCCGC CGGGCGTGGA GTACCACCGG
GTACTCGCCG GGGACAGGCG CCGGATCGGC CGGGGAGTCC TCGCGATCGC GCTCCTGGTG
GCCGGGCTGT TCGCCTCCAA CATCGTTCTG GCCGTGGCCG CCGCCTTCGC CGACAGCCTG
ATGGGCAGGA CCAACCCGAC CTTCGGCGGC ACCGACTACA CACCGCTCTT CCTCGCCGCG
AACCTGGTGT CGATCGCCCT GCTCATCCCG TGGAGCATGC TGCTCCAACG GTGGCTCTAC
GGGGTGCGCG GCCCTTCGCT GCACTCGGTG CTGTCGAGTC TTCGCTTCGA CCTGCTCGGC
CGGGCGCTCC TGTTCATCGG CCCCCTCTGG CTGTTCCTCT CGTTCATCGG CCCCGCCGTC
ATGCCCCTGC GCGAGGTCCA CTGGTCCACC GCCACACTCC TCACGGTCCT CGCCGTCAAC
CTCCTGTTGA CGCCCCTCCA GGCGGCGGGT GAGGAGTACG GTTTCCGCGG GCTCGTGTTC
CGCGTCGCCG GAAGCTGGGG GCGCGGCCCG CGTACGGCGC TGTTCCTCGG CGTCCTCGTC
TCCGGTCTGG CGTTCACCGC CGTCCACCTC TCGACCGATC CGTGGTTCAA CCTCTGGTGC
CTCACGCTCT CCGTCAGCCT GGCCGTCGTC ACCTGGCGCA CCGGCGGCAT CGAGATCGCG
GTCGTCGTCC ACGCTCTGCA CAACACACTC GCCGCCCTGG TCCTCACGGT TCTGCACGCC
GATCCGAACC CCGCGCTCGA CCGTTCGGCG GGTACCGGGT CCGCTGCCCT GGCCGTGCTC
TGCGTCGCCG TCGCGGCCGT CGCGGCGGTG GTGTGGTGGC GCACGCGCGG GACGGGACCG
GCCCTGACCC CCTCCGGCCC GTGA
 
Protein sequence
MHSNDGRAPR APLPQTEAPL PAPSGAGRAQ APARTPSGPA APHPRRRGPC PLPPGVEYHR 
VLAGDRRRIG RGVLAIALLV AGLFASNIVL AVAAAFADSL MGRTNPTFGG TDYTPLFLAA
NLVSIALLIP WSMLLQRWLY GVRGPSLHSV LSSLRFDLLG RALLFIGPLW LFLSFIGPAV
MPLREVHWST ATLLTVLAVN LLLTPLQAAG EEYGFRGLVF RVAGSWGRGP RTALFLGVLV
SGLAFTAVHL STDPWFNLWC LTLSVSLAVV TWRTGGIEIA VVVHALHNTL AALVLTVLHA
DPNPALDRSA GTGSAALAVL CVAVAAVAAV VWWRTRGTGP ALTPSGP