Gene Ndas_0602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0602 
Symbol 
ID9244444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp738436 
End bp739845 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003678555 
Protein GI297559581 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.437149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAC CCGAACACGT CGGCGGACTC CTAGGCCCCG CCCTGGCCCG CATCACCGCC 
CTGGACGAGA ACCACCTCGA TCGGTGGATC GCCCGCGCCC GTGCCCTGCG CGGCTGCGCC
TGCCCGGTGC GCCTGACCGG CGAGACGACC CGTGTGGACG CCTCCACCGG GGAACTGCTC
TCGCGCTACT CCACCGCCAC CGAACCCGGC AACGAACTGC TCATGCGGTG CAAGAACCGT
CGGGCCTCGC GCTGCCCCTC CTGCTCGGAG GAGTACCGGG CCGACACCTA CCACCTGGTC
AAGGCCGGAA TCGTCGGCGG AGACAAGGGC GTGCCCACCT CGGTCGGCGT GCACCCCCGG
GCCTTCCTCA CCCTGACCGC TCCCTCCTTC GGCGCGGTCC ACCGTGGCCC CGGCAAGGAC
GGCCGCACCC GGGTGTGCCA CCCTCGCCGG ACCGGCGCGG CCTGCTTCAC CCACCACCGC
GCCGACGACC CCCGCATCGG TCAGCCACTC GACCCGGCCG CCTACGACTA CATCGGTCAC
GTGCTCTGGC ACGCCCACAC CGGGGAGCTG TGGCGGCGCT TCACCCTGTA CCTGCGCAAC
CACCTGGCCT CGGCCGCCGG TCTCTCGCGC ACGGACTTCT CCAAGCGGGT GCGCATCTCC
TACGCCAAGG TCGCAGAGTT CCAGTCCCGG GGCGCGGTGC ACTTTCACGC CGTGATCCGG
CTGGACGGCT ACACCAAGGA CCCCACCGGC TGGCCGCCGC CTCCGGTGTG GGCCAGCATG
GACATGCTCA CCGCGGCCGT GGACTCGGCC GCCCGCACGG TCTCGCTCAC CTCCCCTGAG
ATTAACGGCC GTACCTGGAC TCTGGGGTGG GGTGAGCAGG TGGACGTGCG CCCCATCGAG
GACTTCGGCC CCGACCGGGC GTTGACGGAC ACGGCCGTGG CCGGGTACAT CGCCAAGTAC
GCCACCAAGG CCGCTGAGGA CACCGGCACC CTGGACCGGC GTATCCACGA CATCGACCAC
GTGGACATGA CGCAGGTGCG CCCGCACGCG GGCAAGCTCA TCTACACCTG CTGGCGTTTG
GGCAACACGC GCCTGTACCC CCAGCTCGAA GACCTCAAGC TGCGCCAGTG GGCGCACATG
CTCGGGTTCC GCGGTCACTT CTCCACCAAG TCGCGCCGCT ACTCCACCAC CCTGGGTGCG
CTGCGTCAGG TGCGGGCCGA CTACGCCGCC GGACGTCCCT GGGACACCGA GACCTTTACC
CCGCTCGTGG TCCAGGGCGA AGAGGGTTCG ACGCTGAGCC TGGGCAACTG GCACTACCTC
GGGCAGGGCC TCACCCCGGG AGAGTGGGCG CTGGCGTCCT TGGTCGCCGG GATGGGCCGC
ACCACCGAAG ACGGGGAGGT GGACAGGTGA
 
Protein sequence
MPEPEHVGGL LGPALARITA LDENHLDRWI ARARALRGCA CPVRLTGETT RVDASTGELL 
SRYSTATEPG NELLMRCKNR RASRCPSCSE EYRADTYHLV KAGIVGGDKG VPTSVGVHPR
AFLTLTAPSF GAVHRGPGKD GRTRVCHPRR TGAACFTHHR ADDPRIGQPL DPAAYDYIGH
VLWHAHTGEL WRRFTLYLRN HLASAAGLSR TDFSKRVRIS YAKVAEFQSR GAVHFHAVIR
LDGYTKDPTG WPPPPVWASM DMLTAAVDSA ARTVSLTSPE INGRTWTLGW GEQVDVRPIE
DFGPDRALTD TAVAGYIAKY ATKAAEDTGT LDRRIHDIDH VDMTQVRPHA GKLIYTCWRL
GNTRLYPQLE DLKLRQWAHM LGFRGHFSTK SRRYSTTLGA LRQVRADYAA GRPWDTETFT
PLVVQGEEGS TLSLGNWHYL GQGLTPGEWA LASLVAGMGR TTEDGEVDR