Gene Ndas_4251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4251 
Symbol 
ID9248125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5067395 
End bp5069008 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content73% 
IMG OID 
ProductDomain of unknown function DUF1814 
Protein accessionYP_003682146 
Protein GI297563172 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.176604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.224045 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTCT CCGGTGACTT CGAGACCCAC CTGACCCTGG ACGCGACCGC CCCCGGGCGC 
GTCGCGGAGG CGTCGGAGTG GGCGCGCGAG CACGGGCTCA AGTTCACCCA CATCGAGCTG
GACCGGGGCG AGTCCCCGTC ACAGCCGATG GTCACCTACC ACTCCGACGG GAGCACCCTG
GCGCGGGAGC TGGCCGTGGC GGAGCGCTGG GCGGCCCTGC TGGCAGGGGC CGGGTTCGCG
GTCACCCGCA CCAAGCTGGA GGTGTCCCGC CGGGCCGCCG GGGTGCCCTG GGACCGGGAG
GAGGCCGAGC TGTTGCCGGA GTCGTGCTAC TTCGAGACGC ACGTCAAACT CCTGCTCCCC
GCGTCGGCCG ACCTGGCGGC GCTGTCCGCG ATCGTGGAAC CCCACCGCGC CCGGTTGTCG
CGCAACGCGC GCAGGGTCCG CGACGACGGC TTCCAGGAAC GGTTCGTCAC CCAGCGGTGC
TCACGTGTGG GCCACAGGGA GGCCGCCCGG TTCGAACACG CGCTGTTGAA GGCCCTGGAG
AGGGCCGGGG TGACCTTCGA GGACAAGGAG GGGTGGCAGC CCAGGGTCCT GTCCGTCGAG
CGGGAGTTCG TCGTCCACGA CACCGCCCTG TCCGTGGACG CGGGGTGGAT GGACGCCGCC
CCGGTCCGCG ACGCCGACGA GGTTCAGCCG AGCGTGTACG CGCCGGACGG CTACCGGCAG
CGCCCGCCCG GCACCTACGT GCCCAACACC GACGGCCCCG AGGCGAGTCA GGGCAAGGTG
TTCGACCCCG CGCTCAAGCA CCTGGACGAC GCCTACCGGG CGGGTGAGCC GGTGTTCACC
GACCCCGGCC TGGGCTCCCG CTGGTGGGAG GCCAACCGGC GGGCGATGGA GCTGGCCCTG
CGCGCGATCT CGGGTACACC GTGGCGGGAC GGCCTCATGC TGCGCGGCAG CATGCTCATG
CCGGTGTGGG TGGGTGACGC CGCCCGGCGC CCCCGAGACC TGGACTTCGT GGTGGTCCCG
GCCGAGACCG CCCCGTTCGG GGACCCGGCC GACCGCATGT TCGCGGACGT GGTCGGGGCC
GTCACGGACG CTTCCGCGCA GGGGATCTCC TTCGACGCCG AGGGCGTGCG GCTGGAGAGC
ATCTGGACCT ACGAGCGGGC CCCGGGTCGC CGCGTGGTCG TCCCGTGGCG GGCCGGGGGC
CTGCCCCCGG GCACCGTGCA GATCGACGTG GTGTTCAACG AGTCGCTGCC CGAGCCGCCG
GTGGCGGTGT CGGTGGCGGG GTCGGACGTG CTGGCGGCCG GGGCGGAGCT GTCCCTGGCC
TGGAAGGTGC TGTGGCTGTA CACGGACACC TACCCGCAGG GCAAGGACCT CTACGACGCG
GTCCTGCTGG CCGAGAGCGC GCGGCCCTCG CGTGAGCTGC TGGTCGGTGT GCTGCGCCCC
GAACTGGGCG ACCGGGCCGA GACCGTGGAC GAGCGCTTCC TGCGGGAGGA GGGCAGCCTC
GACTCCGACG AGTGGGACGA CTTCGTCAGC GACTGCCCGT GGGTGGAGGG CGACGCCGGG
GAGTGGGTGG ACCGCTTCGA GGCGGCGATG GCCCCCGTGT TCCGGGGGGA GTGA
 
Protein sequence
MEFSGDFETH LTLDATAPGR VAEASEWARE HGLKFTHIEL DRGESPSQPM VTYHSDGSTL 
ARELAVAERW AALLAGAGFA VTRTKLEVSR RAAGVPWDRE EAELLPESCY FETHVKLLLP
ASADLAALSA IVEPHRARLS RNARRVRDDG FQERFVTQRC SRVGHREAAR FEHALLKALE
RAGVTFEDKE GWQPRVLSVE REFVVHDTAL SVDAGWMDAA PVRDADEVQP SVYAPDGYRQ
RPPGTYVPNT DGPEASQGKV FDPALKHLDD AYRAGEPVFT DPGLGSRWWE ANRRAMELAL
RAISGTPWRD GLMLRGSMLM PVWVGDAARR PRDLDFVVVP AETAPFGDPA DRMFADVVGA
VTDASAQGIS FDAEGVRLES IWTYERAPGR RVVVPWRAGG LPPGTVQIDV VFNESLPEPP
VAVSVAGSDV LAAGAELSLA WKVLWLYTDT YPQGKDLYDA VLLAESARPS RELLVGVLRP
ELGDRAETVD ERFLREEGSL DSDEWDDFVS DCPWVEGDAG EWVDRFEAAM APVFRGE