Gene Ndas_1193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1193 
Symbol 
ID9245044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1449592 
End bp1450767 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content73% 
IMG OID 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_003679140 
Protein GI297560166 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.120907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.40116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCGC TTCGCGGTGT AACCGTCGTG TCCCTGGAGC AGGCCATCGC CGCTCCCTAC 
GCCAGCCGCC ACCTGGCCGA CATGGGCGCC CGCGTCATCA AGGTCGAGCG CCCCGGCACG
GGCGACTTCG CCCGCGGCTA CGACTCCCGC GTCAACGGCA TGAGCTCGCA CTTCGTGTGG
GTCAACCGCA ACAAGGAGTC CCTGACGCTG GACATCAAGG ACCCGCGCGG CAACGAGGTG
CTGCGCCGGC TGCTGGCCCG CGCCGACGTC TTCATCCAGA ACCTGGCGCC AGGGGCCGCG
GCCCGCGCCG GGCTGGGCGC CGCCGAGCTG CACGCGCGGC ACCCCGGGCT CATCGTCTGC
GACATCTCCG GCTACGGTTC CCCCGGCCCC TACGAGACGA TGAAGGCCTA CGACCTGCTG
GTGCAGAGTG AGTCGGGGCT GCTGTCGGTG ACGGGCAGCG GCGAGGAGAT GGCCAAGGTC
GGCATCTCGG TCTCCGACAT CGCCGCGGGC ATGTACGCCT ACAGTTCGAT CCTGGGCGCG
CTGCTCGAAC GCGCGCGCAC CGGGAAGGGC GCCCACCTGG ACGTGTCGAT GCTGGAGGCC
ACCGCCGAGT GGATGGGCTT CCCGCTCTAC TACACCTACG ACGGCCAGGA GCCGCCGGCG
CGCGCCGGCG CGGCGCACGC CACGATCTAC CCCTACGGCC CCTTCGTCGC GCGCGATGAG
CAGGTGGTCC TGATGGCGAT CCAGAACGAG CGCGAATGGC GCGCGTTCTG CGAGCGCTTC
CTGGAGCGCC CCGCGTTCGC CGAGGACCCC GCCTACGCCA CCAACGCCGC CCGCAGCGCC
AACCGCGACA CGCTCAAGGC CGTCATCGAC CGGCGCTTCG CCGAGCTGGA CGGCGACGAG
GCCACCTCCC TGCTCGCCGA CGTCCCGGTC GCCTACGCCC GGGTGAACAG CCTCGCCGAC
GTGTGGAACC ACCCGCAGCT CGCCGCGCGC GGGCGCTGGC ACGAGGTGGA CACGCCGACG
GGTCGGGTCC CGGCGCTCGC GCCGCCCGGC CCGCGCGACC CGGCCCCCCG CATGGACCCG
GTCCCCGACC TCGGCGAGCA CACCGACGCC ATCCTCGGCG AACTGGGGAT GACGGCCGAG
GAGACCGGCG AGCTGCGGTC GGGCGGGGTG GTCTGA
 
Protein sequence
MQPLRGVTVV SLEQAIAAPY ASRHLADMGA RVIKVERPGT GDFARGYDSR VNGMSSHFVW 
VNRNKESLTL DIKDPRGNEV LRRLLARADV FIQNLAPGAA ARAGLGAAEL HARHPGLIVC
DISGYGSPGP YETMKAYDLL VQSESGLLSV TGSGEEMAKV GISVSDIAAG MYAYSSILGA
LLERARTGKG AHLDVSMLEA TAEWMGFPLY YTYDGQEPPA RAGAAHATIY PYGPFVARDE
QVVLMAIQNE REWRAFCERF LERPAFAEDP AYATNAARSA NRDTLKAVID RRFAELDGDE
ATSLLADVPV AYARVNSLAD VWNHPQLAAR GRWHEVDTPT GRVPALAPPG PRDPAPRMDP
VPDLGEHTDA ILGELGMTAE ETGELRSGGV V