Gene Ndas_4027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4027 
Symbol 
ID9247899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4818573 
End bp4819859 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content78% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681930 
Protein GI297562956 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGC CCGCCTTCGA GGCCCTGCTC ACCGACGCGG GCCGGGAGGT CCTGGCGGGG 
GTGGACCCGA AGGCCGCCGC CGGGGACCAG CTCGCCGCGG CCTCACTGCT GCGCGGTGAT
CCGCGCGTGG CGGCCCTCGA CCCCGGTCTA CCCGTATCCG AGCTGGTAAA CGCCGTCCTG
ACCCAGGTCT CCCTGCGCGA ACGCGGGCGG GCCAAGTTCG GCGAACTCGC GCGGCGGATG
TTCTTCACGC CCAACGGGCT GGAGCAGTCC ACCCGGCGCG TGGTGGCCGA CTACCGCGCG
GAGCGCATGG CCGGGGCGAT CGGCTCGGCC GGGTCGTCGG GGCGGGGCCT GGACGGCGCC
GCCGCGGGCG ACCTGTGCTG CGGGGTCGGC GCGGACCTGC TGGCCCTGGC CGCGCGCGGC
GTTCCGGTCG AGGGCGTGGA CGCCGACCCC CTCACGGTCG CGGTGGCCCG CGCCAACATC
GGGGCCCTGG GCCTGTCCGG GCTGGCCCGG GTCCGCGAGG GCGACGCCTC CGCGACCGCG
CCCGGCCGGT ACCCGCTGCT GTTCTGCGAC CCCGCCCGGC GGGGCGGGCG CGGCCGGGTG
TTCGACCCCT CGGCCTACTC CCCGCCGTGG GACACGGCCG TGGACCTGGC CTCGGGGGCG
CAGGCGGCCT GCCTCAAGGC CGCGCCGGGC GTCCCGCACG AGGCGCTGCC CGGGGACGCG
GCGGCCGAGT GGATCTCGGT GGACGGGGAG CTGAAGGAGA CCGCCCTGTG GTTCGGCGCG
CTGGCCGACG GGCCGCGGCG CCGGGCGACG GTGCTGCACG AGCGCGAGGG GCTGCTCGCC
CGCGAGGGCG CGGTGGCGCA CCTGGACGCC GACCCCGGTC TGGGCCCGGC CCCGGTGGCG
TCGGCCCGGC GCTACCTGTA CGACCCGGAC CCGGCCGTGG TGCGCTCGCA CCTGGTGGCG
GAGGCGGCGG CGCGGGTGGA CGGCGCCCTG CTGGACGAGC GGATCGCGTA CTTCACCGCG
GACAGCGCTG TGCTCTCGCC GCTGTGGCGG GTGCTGGAGG TGGTGGAAGT GCTGCCGTTC
TCGCTGAAGC GGCTGCGCTC GGCGGTGCGC GGCCTCGGGG CGGGGACGGT GACCGTCAAG
AAGCGGGGTT CGGCGGTGGA CACCGAGAAG CTGCGCCGGG ACCTGCGCGC CTCCGGCCCG
GAGTCGGTGA CGGTCGTGCT CACGCGGATC GGCGAGCGGC CCTTCTGCCT GCTGTGCCGC
GAGGTGGGCG ACCCGCGGCG GGGTTAG
 
Protein sequence
MRLPAFEALL TDAGREVLAG VDPKAAAGDQ LAAASLLRGD PRVAALDPGL PVSELVNAVL 
TQVSLRERGR AKFGELARRM FFTPNGLEQS TRRVVADYRA ERMAGAIGSA GSSGRGLDGA
AAGDLCCGVG ADLLALAARG VPVEGVDADP LTVAVARANI GALGLSGLAR VREGDASATA
PGRYPLLFCD PARRGGRGRV FDPSAYSPPW DTAVDLASGA QAACLKAAPG VPHEALPGDA
AAEWISVDGE LKETALWFGA LADGPRRRAT VLHEREGLLA REGAVAHLDA DPGLGPAPVA
SARRYLYDPD PAVVRSHLVA EAAARVDGAL LDERIAYFTA DSAVLSPLWR VLEVVEVLPF
SLKRLRSAVR GLGAGTVTVK KRGSAVDTEK LRRDLRASGP ESVTVVLTRI GERPFCLLCR
EVGDPRRG