Gene Ndas_3874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3874 
Symbol 
ID9247745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4644821 
End bp4645873 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681777 
Protein GI297562803 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACT TGCGCGGCAA CCAGGCCGTG TGGCGGTTCG ACGGTGAGAC GGTCGCGATC 
AGGTACGAGG CCCAGGGCTG GTTCAAGGAC CCCCTGCTCA AACGGATCGG CCAGCTCGAA
CTCCCCGTGG CCGCGATCGC CGAGGTCGAC TTCCAGCCCG GCGCGGGCCC CAGGAAGGGC
TGGCTGCTGC AGCTGCGCCT GCACGAGCGG ACCGACCCCT ACGCGGCGGT CGGCGCGATG
CTCAAGGAGA AGTCCCAGCC CTTCCGGCTC ACCGGCCAGG CCAGCGGCGA ACTGGTCGCC
GAGTACCTGG CCGACCAGAT CCGGTTCGCC GCCGAGCAGA GCGGGCCGCC CGCCCCCGAC
ACGGCCGTCC GCCTGGTGCC CCGGCTGCCC TTCCACATCC AGACCTCCGA GGGGACCGCG
ACCCTGGACG GCTCCACCGT GCGCCTGGTC TGGTCCGGCG GTGAGGCGAG CGGGCGCAAG
CGCAGGGCGC AGCGCCGCGA GTACGACCTC TCCGAGATCA CCGGGGTGGA CTGGGCGCCC
TCCGACGGCT GGGAGTGGGG CTACATGCGC CTGGTCACCG CCGACACCGG CGGCAGGGAC
ACCGGCAAAC CCAAGCAGGA CCTGCACGCC CTCGTCGCCG AGGAGGGGGC GGAGGGCTAC
GACACCCTGC TCATGGCCGC GGCGGTCACC GCCCACGTGT GGGCCGCGGA GGCGTCGGGG
GCCGGCGGCC GGGAGGGGCG CGGCGTCGCC GCCAGGCTCA GGGACCCGCG GTGGTGGCTG
GACGCGGCGG CGCGCTCGAC CGACCAGCTG CGGGCCCTGT CCGCGGGGTC CGCGGCTCCG
GACGCGGGGG AGGGCGCCGG ACCCGGAGCC GGACCGGGGG CCGGGGCGGC CTCCCCGCAG
CAGGCCCTGG ACGCCGCGGG GAAGGCGGAC GGCCGACCCG ACAACGAGTG GATCTTCCAG
CAGATCGAGC GCCTGGGAGA ACTGCACGCC AGGGGACTGC TCACCGACGA GGAGTTCTCC
GCCAAGAAGG CCGAGCTGCT CGGCCGGATC TGA
 
Protein sequence
MDDLRGNQAV WRFDGETVAI RYEAQGWFKD PLLKRIGQLE LPVAAIAEVD FQPGAGPRKG 
WLLQLRLHER TDPYAAVGAM LKEKSQPFRL TGQASGELVA EYLADQIRFA AEQSGPPAPD
TAVRLVPRLP FHIQTSEGTA TLDGSTVRLV WSGGEASGRK RRAQRREYDL SEITGVDWAP
SDGWEWGYMR LVTADTGGRD TGKPKQDLHA LVAEEGAEGY DTLLMAAAVT AHVWAAEASG
AGGREGRGVA ARLRDPRWWL DAAARSTDQL RALSAGSAAP DAGEGAGPGA GPGAGAASPQ
QALDAAGKAD GRPDNEWIFQ QIERLGELHA RGLLTDEEFS AKKAELLGRI