Gene Ndas_5022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5022 
Symbol 
ID9248911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp162939 
End bp164216 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682909 
Protein GI297563936 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACCCA GCCGCCCGCT GCTCGCCCTG ACGATGGGCA CGCTGCTCCT ACTCCCCGCC 
GGGGCCGCGG CCGCGGCGCC CGCCTCCCCC GCCGCAGAGG CCGACCACGC GCGCGTGGTC
AGCGAACAGC CCGTGCAGTG GACGCCGCAC GTCCTGGACG GCGCGGTCAA GGACATCCTG
CGCGTCGGCG ACACCATCCT CGTCGCCGGA AGCTTCACGC GGGTCGGCCA GACCGAGGGG
GGCCGCGCCC ACGACCTGCC CCACCTGTTC GCGTTCGAGC ACGGCACCGG TCGGATCCTC
CACGGGTTCG AACCGGAGGT GGACGGCACC GTCACCACGC TGGCCCCCGG CCCCGACGGG
ACGGTGATCG CGGGCGGGGA CTTCGGCTCG GTGGACGGCG AGCCCGCCGA CGGGCTGGCC
CGGCTGTCGG TGGACGACGG CGACCCGGTG CCGGAGTTCG GCGCCGTCGT CGACGGTGGC
CGCGTCCAGC GCATCGCCAG CGACGGCGAG CACCTGTACG TGGGCGGCAG CTTCTCGGGG
GTGAACGGCG TGGAGCAGCC CACGCTGGTC CGTCTCGACT CCGGGTCCGG CGAGGTGGAC
ACCGGCTTCA CACCCGACGT GTCCGACGCG CGCCAGGGCG TGCTGAAGGT CCAGGAGCTG
GCGCTGAGCC CGGACGGGCG GCGGCTGGCC GTCAACGGCT CCTTCACCAG GATCGACGGT
CACGAGCGGC ACCAGATCGC CATGATGGAC ACCGCGAGCG GCTCGGTCAC ACCGTGGTCC
ACGTCCGCCT ACGAGGAGCC CTGCGACTAC GAGGAGCTGC ACACCTACAT GCGGCGGATG
GCCTTCTCCC CGGACGGCTC CTACTTCGCG GTGGTGACGG CGGGCGGCCC GTACGTCAGG
CCGGGCCTGT GCAAGTCCGT CGCGCGCTTC GAGAACACCG ACACCCCGGG CTCGGAGCCC
ACCTGGAGCA ACAAGACCGG CGGCGACTCG CTGTACTCGG TGGAGATCAC GTCGGCGGCG
GTGTACGTGG GCGGACACCA GCGCTGGATG GACAACCCCG AGGGCGCGCT CAACCCGGGG
CCCGGCTCGG TGGCGCGCGA GGGCATCGCG GCCGTGGACC CCGAGACCGG CAAGGCCCTG
CCGTGGAACC CCGGCCGGGC GCGCGGCCAC GGCGTGGAGG CGATGCTGGC CACCTCCGAC
GGGCTGTACG TGGGCAGCGA CACCGAGCGC CTCGCCGACG AGTACCACGC CCGGCTGGGG
ATGTTCCCGC TCTCCTGA
 
Protein sequence
MKPSRPLLAL TMGTLLLLPA GAAAAAPASP AAEADHARVV SEQPVQWTPH VLDGAVKDIL 
RVGDTILVAG SFTRVGQTEG GRAHDLPHLF AFEHGTGRIL HGFEPEVDGT VTTLAPGPDG
TVIAGGDFGS VDGEPADGLA RLSVDDGDPV PEFGAVVDGG RVQRIASDGE HLYVGGSFSG
VNGVEQPTLV RLDSGSGEVD TGFTPDVSDA RQGVLKVQEL ALSPDGRRLA VNGSFTRIDG
HERHQIAMMD TASGSVTPWS TSAYEEPCDY EELHTYMRRM AFSPDGSYFA VVTAGGPYVR
PGLCKSVARF ENTDTPGSEP TWSNKTGGDS LYSVEITSAA VYVGGHQRWM DNPEGALNPG
PGSVAREGIA AVDPETGKAL PWNPGRARGH GVEAMLATSD GLYVGSDTER LADEYHARLG
MFPLS