Gene Ndas_2962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2962 
Symbol 
ID9246815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3537264 
End bp3538424 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content70% 
IMG OID 
ProductFeS assembly protein SufD 
Protein accessionYP_003680878 
Protein GI297561904 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCAGCC CGAGCACCGA ACCCAAGGCC CACAGCCACG GCCTGGGGGA GATCCCGTTC 
TCCAAGTTGG CCACCCACGG GTCCTTCGAC GTGAACGACT TCCCCGTCCC CGGCGGCCGT
GAGGAGGAGT GGCGCTTCAC TCCGATCCGC CGCCTCAAGG GCCTGCACGA CGGCAGCGCC
GTCGCCGACG GCACCGACAA GCTCGACGTG GACGTCCCCG AGGGCGCCAC CGTCGAGACG
GTCGGCCGCG GCGACGCGCG GCTCGGCACC GCCGGGTTCC CCGCCGACCG CGTCACCGCC
CAGGCCTACA CCTCCTTCGA GGAGGCCACG GTCATCACGG TCGCGCAGGG CGCGGCCCTG
GAGCGGCCGA TCACCGTCAA CCGCCAGGCC CAGGGCGGCA CGCGCTACGA CCAGTTCGTC
CTGGACGTCA AGCCGCTGGC CGAGGCGGTC GTGGTCCTCA ACCAGACCGG CACGGGCGTG
CGCGCGGGCA GCGTGGACAT CCACGTCGGC GAGGGCGCGC GCCTGACGCT GATCAGCGTC
CAGGACTGGG ACGGCGACGC CGTCGACGTC TCGCAGCACA ACGCCCAGGT GGCCAAGGAC
GCCACGTTCA AGTCGATCGT GATCACCCTG GGCGGCGACC TGGTCCGGCT CTCCCCGAAG
GTGGCCTACA AGGGGCGCGG CGCCAACGCC GAGCTCCACG GGCTGTACTT CACCGGCGAC
GGCCAGCACC ACGAGCACCG CTCGCTCATC GACCACAACA TGTCCAACAC CCGCTCGCGG
GTGGAGTACA AGGGCGCGCT GAGCGGCAAG GACGCCCACG GTGTGTGGAT CGGCGACGTG
ATCATCGGTG AGGGCACCAC CGGCACCGAC TCCTACGAGC ACAACCGCAA CCTCCAGCTC
ACCGACGACA CCCGTGTGGA CTCGGTGCCC AACCTGGAGA TCTTCACCGG TGAGGTGGAG
GGCGCCGGGC ACGCCGCCGC CAGCGGTCGC CTCGACGACA TCCACCTGTT CTACCTGCGC
TCCCGGGGCA TCCCGGAGGA CGAGGCCCGG CGCCTGGTCA TCCGCGGCTA CTTCCTGGAG
CTCATCAACC GGATCCCGGT CGAGGAGCTG CGTGAGGAGA TCATGGCCAA GGTCGAGCGC
AAGCTGGCCG CCCATGAGTG A
 
Protein sequence
MTSPSTEPKA HSHGLGEIPF SKLATHGSFD VNDFPVPGGR EEEWRFTPIR RLKGLHDGSA 
VADGTDKLDV DVPEGATVET VGRGDARLGT AGFPADRVTA QAYTSFEEAT VITVAQGAAL
ERPITVNRQA QGGTRYDQFV LDVKPLAEAV VVLNQTGTGV RAGSVDIHVG EGARLTLISV
QDWDGDAVDV SQHNAQVAKD ATFKSIVITL GGDLVRLSPK VAYKGRGANA ELHGLYFTGD
GQHHEHRSLI DHNMSNTRSR VEYKGALSGK DAHGVWIGDV IIGEGTTGTD SYEHNRNLQL
TDDTRVDSVP NLEIFTGEVE GAGHAAASGR LDDIHLFYLR SRGIPEDEAR RLVIRGYFLE
LINRIPVEEL REEIMAKVER KLAAHE