Gene Ndas_1497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1497 
Symbol 
ID9245347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1835057 
End bp1836415 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content69% 
IMG OID 
Productprotein of unknown function DUF245 domain protein 
Protein accessionYP_003679433 
Protein GI297560459 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00249472 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGAACGAC GGATCTTCGG GCTGGAGAAC GAGTACGGGG TCACGTGCAC CTTCCGTGGA 
CAGCGCCGCC TGTCCCCGGA CGAGGTCGCC CGCTACCTCT TCCGCAGGGT GGTCTCCTGG
GGGCGCAGCA GCAACGTGTT CCTGCGCAAC GGCGCCCGCC TGTACCTGGA CGTGGGCAGC
CACCCCGAGT ACGCCACCCC CGAGTGCGAC AGTCTCGTGG ACCTGGTGGC CCACGACAAG
GCCGGCGAGC GCATCCTGGA GGGGCTGCAG GTCGACGCCG AGCAGCGCCT GCACGAGGAG
GGCATCGCCG GGGACATCTA CCTGTTCAAG AACAACACCG ACTCGGCGGG CAACTCCTAC
GGCTGCCACG AGAACTACCT CGTGGGACGG CACGGCGAGT TCGGACGCCT GGCCGACGTG
CTCATCCCCT TCCTGGTCAC CCGCCAGATC ATCTGCGGCG CCGGAAAGGT GCTCCAGACC
CCCCGCGGCG CCCTGTTCTG CGTCAGCCAG CGCGCCGAGC ACATCTGGGA GGGCGTCTCC
TCGGCCACCA CCCGCTCGCG CCCCATCATC AACACCCGCG ACGAGCCGCA CGCGGACGCC
GAGCGCTTCC GGCGCCTGCA CGTCATCGTC GGCGACTCCA ACATGAGCGA GACCACCAAC
CTGCTCAAGC TGGGCTCCAC CGACCTGGTG CTGCGCATGA TCGAGGCCGG GGTGGTCATG
CGCGACTACA CGCTGGAGAA CCCGATCCGG GCCATCCGCG AGGTCAGCCA CGACATGACC
GGCCGCCGCA AGGTACGCCT GGCCAACGGG CGCGAGGCCA GCGCGCTGGA GATCCAGCGC
GAGTACCTGG ACAAGGTGCA GAGCTACGTC GACCGGCACG GCACCGACGC CACCGGCAAG
CGCGTCCTGG AGCTGTGGCA GCGCACCCTG GAGGCGGTCG AGACCCAGAA CCTGGAGACC
GTCTCCCGCG AGATCGACTG GGTGGCCAAG TACCTGCTGC TGGAGCGCTA CCGCGACAAG
CACGACCTGT CCCTGTCCTC GCCGCGGGTG GCCCAGCTCG ACCTGACCTA CCACGACATC
CACCGCGACC GGGGACTGTT CTACCTGCTC CAGGGCCGCG GCCAGATGGA ACGGGTGGTC
GGCGACCTCA AGATCTTCGA GGCCAAGTCG GTGCCGCCGC AGACCACGCG GGCCCGGCTG
CGCGGCGAGT TCATCCGGCG CGCCCAGGAG CAGCGCCGCG ACTTCACGGT GGACTGGGTG
CACCTCAAGC TCAACGACCA GGCCCAGCGC ACGGTGCTGT GCAAGGACCC CTTCAAGTCG
GTGGACGAGC GGGTGGAGAA GCTCATCGCC GGTATGTAG
 
Protein sequence
MERRIFGLEN EYGVTCTFRG QRRLSPDEVA RYLFRRVVSW GRSSNVFLRN GARLYLDVGS 
HPEYATPECD SLVDLVAHDK AGERILEGLQ VDAEQRLHEE GIAGDIYLFK NNTDSAGNSY
GCHENYLVGR HGEFGRLADV LIPFLVTRQI ICGAGKVLQT PRGALFCVSQ RAEHIWEGVS
SATTRSRPII NTRDEPHADA ERFRRLHVIV GDSNMSETTN LLKLGSTDLV LRMIEAGVVM
RDYTLENPIR AIREVSHDMT GRRKVRLANG REASALEIQR EYLDKVQSYV DRHGTDATGK
RVLELWQRTL EAVETQNLET VSREIDWVAK YLLLERYRDK HDLSLSSPRV AQLDLTYHDI
HRDRGLFYLL QGRGQMERVV GDLKIFEAKS VPPQTTRARL RGEFIRRAQE QRRDFTVDWV
HLKLNDQAQR TVLCKDPFKS VDERVEKLIA GM