Gene Ndas_4401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4401 
Symbol 
ID9248276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5234775 
End bp5236376 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682296 
Protein GI297563322 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0879683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGCCCC CCACCCCCGA CCGGGCCCCG CACGGGACCG GGACCAGGCC GCCGGAGGCC 
CCGCACGGCG CGCTCCCGGC GGTGCCGGGC GGCCAGCCCC CGCGGGCCCC GGCGCCCGAC
CGCGAGGGCG CCCGCCCGCC CCGACGCGGG CGGAGCGGGC ACGGCGCCCG TGAACGGGTC
TCCGGGCTCC GCCGCTGGGC CGGCACCACC CCGGCTCGGG TGTGGCTGCT GTCCTCGGTG
TGCGTGCTGG CCGTCCTCGG ACTGCTCGGC TCGGCGACCG CCACCCTCGG CCAGGCCCGC
GACGGCCTGC GCGTGCTCGG GGACGACGCC GGTCCCCAGG CACTGGCCAC CACCGACCTG
TACCTGGCGC TCGCGGACAT GGACGCCCGC GTGGCCGACG TGCTGCTCAT GGGCACCGAC
CACGGCCTGG GCCCGGGCCG CGAGGACGCC CTGGAAGGGT ACGAGGCGAG CCGGGCACGG
GCCAACGAGG CCCTGCTCGA AGCGGCATCG CTGACCGAGG GCGACAGCGT GGAGGAGCGC
AACGTCGAGG CGGTCCTGAA CGGGATGGGC GCCTACGAGC AGCTCGCCAG AGAGGCGCGG
CTGCTCAACG ACGAGGCACA GGCCCCGCCC GGCCGGGTGG ACGGGGCGGC CCTGGAGGCC
TACGGCGAGG CCACGCGGCT GATGCACGCC GAACTGCTGC CCAAGGCGTT CAACCTGGGG
CTGGACGCGT CGGCGAACGT GCGCGCCAAC CACGAGGAGG GGCAGTCCTC CGTCGCGCTC
GGCATGCTCT GGGTGGGCGC GGTCGGCACG GTCACCGTCG CCGCGCTGCT CGGCCTCCAG
CTGTACCTGC GCGTGCGGTT CCGGCGCCGG TTCAGCGCGC CGCTGCTGGC CGCCACGGCG
GTCGCAGTCC TGCTCACCGG CGGGGTGGTC CTCGTCCTGA CCGTCAGCGG CGGGCACCAA
CGCGACGCCA AGGAGGAGGG GCTGGACGCG GCCATGGCGC TGTCCCGGGC CGGGGCCATC
TCCACCGACA TGCAGGCGGA CCAGAGCCGC TACCTGCTCG ACCGGGACAT GGCCGACAAC
TACGAACTGG TGTACCTGGA GCGGGCGCAG CAGGTGCTGT ACCGCCCGGC CAGCAACCTG
GACGCGTACT ACGGTCAGAT CGAGGGCGTG GTCGCCGCCT ACCCGGAGCT GCCGGGATCG
GACGGCTCCG ACCCCGAGGA CCCGGGCACG CTCGGCTACC TGGGCCAGCG GGCCCAGGAC
ACGCTGCTGC CGGGGCAGGA GGGCGCGCTC GCCGAGGTGC TGGAGTCCTA CAACGCCCTC
CAGGCGGAGG ACCGGGCGCT GCGTTCGGCC GCCGAGGAGG GGGACCTGGC GGGGGCCGTC
GGGGTCCGCA TGGGCGTCGC GCACTCCGAG GACGGCGCCT TCCAGACCTA CGAGGCCGCG
CTGGCGGAGC TGACCGACCT GCACGAGGAG GCGTTCGACG CGGGCATCGG ACGCGCTGAC
GCGGCGCTCA CCCCCTGGAC GTGGCTGCTG CCCGCCGCGA CGGCCCTGCT GCTGGTCCTG
CTCGTCCTCG GGGTCCGCCC CCGTCTGGCC GAGTACCGCT GA
 
Protein sequence
MTPPTPDRAP HGTGTRPPEA PHGALPAVPG GQPPRAPAPD REGARPPRRG RSGHGARERV 
SGLRRWAGTT PARVWLLSSV CVLAVLGLLG SATATLGQAR DGLRVLGDDA GPQALATTDL
YLALADMDAR VADVLLMGTD HGLGPGREDA LEGYEASRAR ANEALLEAAS LTEGDSVEER
NVEAVLNGMG AYEQLAREAR LLNDEAQAPP GRVDGAALEA YGEATRLMHA ELLPKAFNLG
LDASANVRAN HEEGQSSVAL GMLWVGAVGT VTVAALLGLQ LYLRVRFRRR FSAPLLAATA
VAVLLTGGVV LVLTVSGGHQ RDAKEEGLDA AMALSRAGAI STDMQADQSR YLLDRDMADN
YELVYLERAQ QVLYRPASNL DAYYGQIEGV VAAYPELPGS DGSDPEDPGT LGYLGQRAQD
TLLPGQEGAL AEVLESYNAL QAEDRALRSA AEEGDLAGAV GVRMGVAHSE DGAFQTYEAA
LAELTDLHEE AFDAGIGRAD AALTPWTWLL PAATALLLVL LVLGVRPRLA EYR