Gene Ndas_3766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3766 
Symbol 
ID9247635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4524169 
End bp4525407 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681670 
Protein GI297562696 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGAACG AAACGCTAGA ACGCCTGCGC CTTCCGGGGG CCTGGGTCCT GCTTGCAGCC 
GCGGGCGTGC TCATGCTCTC CGGACTCATC CAGGTGATCG TCCGATCCAG CGGCTGGCGC
TCGGAGACGT CCTTCGCCTC GGCCATGTAC GACGCGGGCT CCGCCAACTT CTTCGGTGTG
ACCATCACGA TCCTGGTCGC CGTCGCCGTG GCCCTCGTGC TGACCAGCGA AAGGACCAGG
GCGGGCGCCT CCCCGGTCGT GCTGACCGCG ATGATCATCA CCGGTGTGGG CCTGCTGTTC
TGCCTGATCA CGGTGATCTG CGGCTTCATC GACGCGATCA ACATCGGCCA CGGCTTCGCC
CGGATGCTGG GGACCGTCGC CCAGGGCGCG GTGCTCGGCA TCTTCGGCTT CGCCGCGCTC
AAGGCCTTCA ACGACCCCAC CCTGGTCCCC AGGGTGGTGC GGCCGCAGAA CGCCTACCCG
CAGCAGTTCC CGCCCGCCAC CGGCGCGCAG CAGTCCTTCG CCCAGCCGGC CTACCCGGGC
CAGCCCGCCG ACCCCGCCCA GCAGTACGGC GTGGACCCCG CGCAGCAGGC CTACGGCCAG
CAGTACGGGA CCGATCCCGT CCAGCAGCAG TACGGGACCG GCGCCCAGCA GGCCTACGGC
CAGCAGTACG GCACCGACGC CCCGGCCCAG CAGTACGGGA CCGACCCCGT GCAGCAGCAG
TACGGCACGG GCGCCCAGCA GGCCTACGGC CAGCAGTACG GCACCGACGC CTCCGGGCAG
CAGCCCGTCC AGCAGGGCTA CGACGCCTCC CAGCAGTACG GCCAGCAGTA CGGGGCGGAC
CCCGCGCAGC AGGCCTACGG CCAGCAGTAC GGCACCGACG CCCCGGGCCA GCAGGCCGCC
TACGGCCAGC CGGGCGAGTA CGCCTACGAC CCGTCCCAGT ACGCCCCGCA GAGCACCGAG
CAGCAGCCCG CCTCCGGCGA GCAGGCCGCC CAGGACGCGA TCCAGTACGG CTGGTACCAG
GGCGCTGACC AGGGGCAGCA GGCGCAGGAC ACCCCCGCCG ACAGCAATCT TGATCCTTTC
TTTAACTCCG GTGAGAACAA CGGCAACCAG ACGCCCGGCC AGGGCGGCGG ATCGTACGGA
GGGCAGTACG GAGCGGGCAC CGGATACGGC TCTGACCAGC AGGGACAGGG CGGAACCGGT
GACCAGCAGG GGTGGTACGG CGGCGAGGAC AAGCGCTGA
 
Protein sequence
MKNETLERLR LPGAWVLLAA AGVLMLSGLI QVIVRSSGWR SETSFASAMY DAGSANFFGV 
TITILVAVAV ALVLTSERTR AGASPVVLTA MIITGVGLLF CLITVICGFI DAINIGHGFA
RMLGTVAQGA VLGIFGFAAL KAFNDPTLVP RVVRPQNAYP QQFPPATGAQ QSFAQPAYPG
QPADPAQQYG VDPAQQAYGQ QYGTDPVQQQ YGTGAQQAYG QQYGTDAPAQ QYGTDPVQQQ
YGTGAQQAYG QQYGTDASGQ QPVQQGYDAS QQYGQQYGAD PAQQAYGQQY GTDAPGQQAA
YGQPGEYAYD PSQYAPQSTE QQPASGEQAA QDAIQYGWYQ GADQGQQAQD TPADSNLDPF
FNSGENNGNQ TPGQGGGSYG GQYGAGTGYG SDQQGQGGTG DQQGWYGGED KR