Gene Ndas_3905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3905 
Symbol 
ID9247776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4676854 
End bp4678215 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681808 
Protein GI297562834 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAAG CAGTAGCGTC CGCCCTCGCC GAGCTGGCAC GCCGGGACGA GCGGGCGGCC 
GACATGGCCC GGTCGGCCCT GGACACGCTG GCTCCGGAGC AGGAGATCGA GAGGCTCAAC
CAGCACGCCG TCCAGCGGTT CTGCTGGTTC GAGCTACCGG TCCGCTTCCA CGACCGCGCC
GAGGACCGGC TGTTCGCGGT CCGCGCCCTG GGCAACCTCT TCGCCCTCCT CCAGCTGCAC
CGGTACGCGC AGATCTGCTC GGCGCCGGAG ACCGCGGCCC TGCTCACCGT CTACGAAGGC
GACCACGCCG CGGGCCTGGC GATGTACGAG CGGCTCATGT GGGAGTCGGG CGTGGAGCCG
CCGGACCTGC CCGAGCTGAC CTGGGGGACG GCGGTCGGCG ACGCCGAGAT CCTGGCCCGC
GACGAGACCG CGTCGGCGCT GGAGCTGGCG ATCACCCTGG GCGACGTGCG CCCCGGCAAG
CGCGGATGGC GGCCCGCCCA GGAGCGGTTC GCCCGGTCGT TCCTCACCCA GCCGGACAAC
CGGGGGCTGA GCCACCTGGA CCGGATCCGC GAGGAGCGGG TGCGGGCCTG GTTGAGCTCG
TCCGCGCACC CGCACCGCCA GCGCCTGTGG CCGCTGGTGG GCCAGATCAT CGCGGGCGCC
GACGTGCCGC GCGGGGCCGA GGCCGCGATG GCGCCGCTCC AGCGGCTGCT GGACCTGGCC
GCGGAGGGGA TCGCGCTCAC CCAGATCGGC TACATCTCGC CGGGCGTGGT GCGCCAGATG
TGCGAGGACT TCGGGTGGCG GACCACCCCC GAGCCGCCGC GCAGCGAGAC CGACGCCACC
CAGCTCATCG CCCTGCACCA GGCGCTGCGG GGCATGCGCG CGGTGCGCCG GTCGGGCCGC
CGCCTGGTCC TGACGCGGCG CGGGCGCCAG CTGCGGGAGG ACCCCGAGGC GCTGTGGCAG
GCGGCCACCG AGACCCTGTG CCGCACCGGG GGCCTGGACC AGGCCGCGGC GGAGACGCTG
CTGGGCATGC TGCTGGCGCG CACCCCGCAG GGCAGCGGTT CGCACGCGCG TCGCGGCGAG
TCGGACGTGG AGGCGGCCGA GAGGGCGCTG ACCGAGTCGG GGTGGGTCCC GGCGGAGCCC
CCGGCTCCCT CGGGCAGGCA CGCCGCGTCG CACCGGCGCA CCGCGGAGGC CGCCTCGGAC
CAGGTGCGCG CGCTGGTGAT GGCGGTGAGC TGGCTGCTGG AGACCCTGGG CCTGCTCACC
GACGACGACG GAGCCGGACG CCGGGAGCTG ACCGCGCCGG GGCGGGCGTT CGCGATCGCC
TGCCTGCACC AGTCGGCGGT GGCGCCCCGG GCCGTGGTCT GA
 
Protein sequence
MREAVASALA ELARRDERAA DMARSALDTL APEQEIERLN QHAVQRFCWF ELPVRFHDRA 
EDRLFAVRAL GNLFALLQLH RYAQICSAPE TAALLTVYEG DHAAGLAMYE RLMWESGVEP
PDLPELTWGT AVGDAEILAR DETASALELA ITLGDVRPGK RGWRPAQERF ARSFLTQPDN
RGLSHLDRIR EERVRAWLSS SAHPHRQRLW PLVGQIIAGA DVPRGAEAAM APLQRLLDLA
AEGIALTQIG YISPGVVRQM CEDFGWRTTP EPPRSETDAT QLIALHQALR GMRAVRRSGR
RLVLTRRGRQ LREDPEALWQ AATETLCRTG GLDQAAAETL LGMLLARTPQ GSGSHARRGE
SDVEAAERAL TESGWVPAEP PAPSGRHAAS HRRTAEAASD QVRALVMAVS WLLETLGLLT
DDDGAGRREL TAPGRAFAIA CLHQSAVAPR AVV