Gene Ndas_4224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4224 
Symbol 
ID9248098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5042228 
End bp5043334 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content68% 
IMG OID 
ProductPTS system, mannitol-specific IIC subunit 
Protein accessionYP_003682122 
Protein GI297563148 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.110658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAC AGCAGAGCAC GGACAGACTC GCCTCGGTGC GCTCCGGAGT ACAGCGCTTC 
GGAGGGTTCC TGTCGAGCAT GGTGATGCCC AACATCGGCG CGTTCATCGC CTGGGGCCTG
ATCACCGCCC TGTTCATCCC CGACGGGTGG TGGCCCAACG AGCAGATGGC CGGGCTCGTC
GACCCGATGA TCAAGTACCT GCTGCCGCTG CTGATCGCCT ACACCGGCGG CGCGCTGGTG
CACGACAGGC GCGGCGGCGT GGTCGGCGCG GCCGCGACCA TGGGCGTGAT CGTCTCCGCG
GACATCCCCA TGTTCCTGGG CGCGATGTTC ATGGGTCCGT TCGCGGCCTA CCTCATGAAG
CACTTCGACC GGGTCGTCCA GCCGCGCATC AAGGCCGGCT TCGAGATGCT GGTCAACAAC
TTCAGCGCCG GCATCCTCGC CGCGATCCTG GCCGCCCTGG GCGTCTACGC GGTCGGACCG
GTCGTGGAGG GCATCGCCAC CGGCCTGGGC AAGGGCGTGC AGTTCCTCAT CGACCTGAGC
CTGCTGCCGC TGGTCTCGGT CATCGTCGAG CCCGCCAAGG TGCTGTTCCT CAACAACGCC
ATCAACCACG GCGTCTTCAC CCCGCTGGGC ACGGCCCGCG CGGTCGCCGA CGGCAGGGCC
ATCGAGTTCC TCATCGAGTC GAACCCCGGA CCGGGCCTGG GCATCCTGCT GGCCCTGATG
TTCTTCGGCT CCAAGGTCAG CCGCGCCACC GCGCCCGGCG CGGCCGTCAT CCACTTCTTC
GGCGGGATCC ACGAGATCTA CTTCCCGTAC ATCCTCGCCC AGCCGAAGCT GATCCTCGCC
GCGATCGGCG GCGGTATGTC CGGCGTCGCG ACCTTCATGA TCATGGACGC CGGGCTCGTC
TCCGCCGCCT CCCCCGGCAG CATCATCGCG ATCATGGCGG TCACCCCGCA GGGAGGCCAC
CTGTCGGTCC TGGCCGGGGT CGTCGCCGCC ACCATCGTCT CCTTCGTCAT CGCCTCGCTC
CTGCTCGGCT TCGGCCGGTC CGAGCGCAAG GCCGAGCGCG AGGAGAAGGC CAAGCAGGAA
GCCGCTCAGA ACCAGGAGAA CAGCTGA
 
Protein sequence
MTTQQSTDRL ASVRSGVQRF GGFLSSMVMP NIGAFIAWGL ITALFIPDGW WPNEQMAGLV 
DPMIKYLLPL LIAYTGGALV HDRRGGVVGA AATMGVIVSA DIPMFLGAMF MGPFAAYLMK
HFDRVVQPRI KAGFEMLVNN FSAGILAAIL AALGVYAVGP VVEGIATGLG KGVQFLIDLS
LLPLVSVIVE PAKVLFLNNA INHGVFTPLG TARAVADGRA IEFLIESNPG PGLGILLALM
FFGSKVSRAT APGAAVIHFF GGIHEIYFPY ILAQPKLILA AIGGGMSGVA TFMIMDAGLV
SAASPGSIIA IMAVTPQGGH LSVLAGVVAA TIVSFVIASL LLGFGRSERK AEREEKAKQE
AAQNQENS