Gene Ndas_5227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5227 
Symbol 
ID9249120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp379845 
End bp380939 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content76% 
IMG OID 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_003683113 
Protein GI297564140 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.012301 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACCCC TGAACGGAAT CCGCGTCGTC GAGTTCACCG GGATCGGCCC GGCTCCGATG 
GCCGGGATGC TCCTCGCCGA CCTGGGCGCC AGCGTCATCC GCCTCGACCG CCCCGCGGCC
GCCGACGCGA TGAACTCCGG AGCGGGCGGA CCCCACCTGA GCGAGGGGCG CACCGTCCTG
CCCGCCGACC TCAAGTCCGA GGAGGGCCTC GCGCTCGCCC GCGACCTGGC CTCCCGCGCC
GACGTGCTCC TGGAGGGCTT CCGCCCCGGG GTCATGGAGC GCCTGGGCCT GGGCCCCGAG
GTCTGCCTGG AGGCCAACCC GCGCCTGGTC TACGCGCGGG TCACCGGCTG GGGCCAGGAC
GGGCCGCTCG CCCGCGCCGC CGGGCACGAC ATGAACTACA TCTCCGTGAA CGGCGCCCTG
CACGCCATCG GGCGCGCGGG GGGCCCGCCG GTCCCCCCGG TCAACCTCCT GGGCGACTTC
GCCGGGGGCA CCATGTTCGC GGTCACCGGC GTCCTCGGCG CGCTGGTCGA GCGCCAGTCC
TCCGGTCGCG GCCAGGTCGT GGACGCCGCG ATGGTGGACG GCAGCGCGCT GCTGATGTCG
ATGCTGCACG AGGACCGCGC GCGCGGTTCC TGGAGCGACG AGCGCGGCAC CAACTACCTG
GACACCGGCG CCCCCTGGTA CGACGTCTAC GAGTGCGCCG ACGGCCGGTA CGTGTCGGTC
GGCTGCATCG AGCCCCAGTT CTACGCCGCG TTCCTGGAGG GCACCGGCCT GGCCGGGGAG
GACCTGCCCG ACCAGTGGGA CCGGTCGGGC TGGCCCCGGC TGCGGGAGCG GTTCGCCGAG
GTCCTGCGCA CCCGTACCCG CGACGCGTGG GGCGAGGTGT TCGAGGGCGC CGACGCCTGC
GTGATGCCCG TCCTGTCCCT GGAGGAGGCC CCCGACCACC CGCACGTGCG CGCCCGCGGC
TCACTCGTGC GCGACGGCGA CCGGATCCTG TCGGGCCCGG CCCCGCGCTT CGACCGCACC
CCGGGCGGGG TCACCCGGGG CGCCGAGCAG CCCGACACCG CGCGGACCCT CCGGGAGTGG
GGCCTCACCC CCTGA
 
Protein sequence
MGPLNGIRVV EFTGIGPAPM AGMLLADLGA SVIRLDRPAA ADAMNSGAGG PHLSEGRTVL 
PADLKSEEGL ALARDLASRA DVLLEGFRPG VMERLGLGPE VCLEANPRLV YARVTGWGQD
GPLARAAGHD MNYISVNGAL HAIGRAGGPP VPPVNLLGDF AGGTMFAVTG VLGALVERQS
SGRGQVVDAA MVDGSALLMS MLHEDRARGS WSDERGTNYL DTGAPWYDVY ECADGRYVSV
GCIEPQFYAA FLEGTGLAGE DLPDQWDRSG WPRLRERFAE VLRTRTRDAW GEVFEGADAC
VMPVLSLEEA PDHPHVRARG SLVRDGDRIL SGPAPRFDRT PGGVTRGAEQ PDTARTLREW
GLTP