Gene Ndas_0845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0845 
Symbol 
ID9244690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1037680 
End bp1038975 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content76% 
IMG OID 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_003678795 
Protein GI297559821 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0637303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCG ACCTGCGCGC GGACGGGACC CCCGCCGGGC CCCGGACCGG AACCGGGAGC 
GCGACCGGGG CCGCGCCGCT GCCGCTGGCG GGCGTGCGCG TCGCCGACCT GTCCCGCGTG
CTGGCCGGCC CCTACGCCAC CATGCTGCTC GCCGACATGG GCGCCGAGGT GGTCAAGGTG
GAGCAGCCCG GACGCGGCGA CGACACCCGT TCCTGGGGCC CGCCCTGGGC CGGGGAGGCC
GGACCGGAGG GGCACGGCGA GGCCGCCTAC TTCCTGTCGG TGAACCGCAA CAAGCGCAGT
CTGGCCGTGG ACCTGAAGGA CCCCGAGGGC CTGGCCGCGG TCCGGGAGCT GTGCGCGGCC
TCCGACGTGG TCGTCCAGAA CTTCCGGCCC GGGGTGATCG ACCGGCTCGG GCTCGGCTAC
GAGGCCGTCA GCGCCCGCAA CCCGGCCGTC GTGTACTGCT CGGTGAGCGG GTTCGGACCC
GAGCACGAGC CCGCGACGCG CCCCGGCTTC GACATCGTGG TGCAGGCCGA GAGCGGGCTC
ATGGCCGCCA CGGGGCCCGC GGAGGGACCG GCGAGCAAGG TGGGCGTGGC CCTGACCGAC
GTGCTCACCG GGCTCAACGC AGCCGTGGGC GTCCTGGGCG CGCTCATGCG GGCCCGCGTC
ACCGGGCGCG GCGAGAACAT CAGCGTGTCG CTGATCAACT CCACCCTCTC CGGCCTGGTC
AACCTCACCC AGCAGGCGCT GGTGACGGGG GCCGAACCCG CCCGGGTCGG CAACGCGCAC
ACCACGATCG TGCCCTACCA GACCTTCGCG ACGGCGGACG CCGAGATCGT CGTCGCCGCC
GGGAACGACG CCCTGTACCA GCGGCTGTGC GCGGCCCTGG ACCGCCCCGA CCTGGGCGCC
GACCCGCGCT ACGCCACCAA CCCCGGCCGG GTGGCGCACC GGGAGGAGCT GGTCGGCGAG
CTGACGGCGA CCCTGCGCTC GCGGCCCGCC GACCACTGGA TGGAGCTGCT GGTGGGAGCC
GGGGTACCGG TGGGACGGGT GCGCGGAGTG CTCGACGCGC TGCGCGCCGC CGACGCCAGC
GGCGACGACG TCCTGCGCAC CGTCAAGCAC CCCACCGCCG GGCTGATCGA ACAGGTCCGC
GCGGGGTTCC GGCTGGAGGG CACCCCGCCG CCGCTCGGCG CGCCGCCGCT GCTGGGGCAG
CACTCCCGGC AGATCCTGGC CGAACTCGGC GTGGACGGGG CCGACGTGGA CGCGATGGTC
GCGCGCGGCG CCGTCCAGCA GCCCGACCTG TCCTGA
 
Protein sequence
MTGDLRADGT PAGPRTGTGS ATGAAPLPLA GVRVADLSRV LAGPYATMLL ADMGAEVVKV 
EQPGRGDDTR SWGPPWAGEA GPEGHGEAAY FLSVNRNKRS LAVDLKDPEG LAAVRELCAA
SDVVVQNFRP GVIDRLGLGY EAVSARNPAV VYCSVSGFGP EHEPATRPGF DIVVQAESGL
MAATGPAEGP ASKVGVALTD VLTGLNAAVG VLGALMRARV TGRGENISVS LINSTLSGLV
NLTQQALVTG AEPARVGNAH TTIVPYQTFA TADAEIVVAA GNDALYQRLC AALDRPDLGA
DPRYATNPGR VAHREELVGE LTATLRSRPA DHWMELLVGA GVPVGRVRGV LDALRAADAS
GDDVLRTVKH PTAGLIEQVR AGFRLEGTPP PLGAPPLLGQ HSRQILAELG VDGADVDAMV
ARGAVQQPDL S