Gene Ndas_4820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4820 
Symbol 
ID9248704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5711050 
End bp5712393 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content70% 
IMG OID 
ProductProtein of unknown function DUF2252 
Protein accessionYP_003682710 
Protein GI297563736 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.923578 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACC GGCTCGATTC CACCGACAGC TCCGCGGAGC GCCGCGACCT CATCGTCGCC 
ACGCTGGAGA ACGCCTTCTC GGACCTGATG TCCGCCGACC CGGCGGCGTT CCGCGTCAAG
TTCCGCAAGA TGGCGGCCAA CCCGTTCGCC TTCTACCGGG GCAGCGCCGC GCTCTTCTAC
GACGACGTCT CGGGCATGGA CGACCCCTGG GCCGACGAAC GCACCTCGCG GGTGTGGATC
CAGGGTGATC TGCACGCGGA GAACTTCGGC ACCTACATGG ACTCCACCGG GCGGCTGGTG
TTCGACGTCA ACGACTTCGA CGAGGCCTAC CTCGGCCACT TCACCTGGGA CGTGCTCAGG
TTCGCGGCCA GCATCGGGGT CATGGGCTGG CAGAAGGCCC TGTCCGACGA GGACATCAGC
GCGCTCCTGC CGCACTACGT CGACGCCTAC ATCGCCCAGG TGCGCGAGTT CGCGACGACG
GGCAACGACT CGGAGTTCTC GCTCAAGCTG GGCAACACCG ACGGCACCGT GCACGACGTG
CTCCAGAAGA CCCGGCTCAA CAGCCGCGCC GAGATGCTGT CGTCCATGAC GACCCGCGAC
GGGTACACCC GCCGCTTCGC CGAGGGCCCC CGGGCCCGCC GCCTGGACGA CGCCGAGCGG
GAACGGGTCC TGGCCGCCTA CGAGGCCTAC CTGGGCACCA TCCCCGAGGA CCGGCGGTAC
GCGTCGATCA ACTACGCGGT CAAGGACGTG GTGGGCAGCG GCGGCTTCGG GATCGGCTCG
GCGGGGCTGC CCGCCTACAC CCTGCTCATC GAGGGCCAGT CGGAGGCCTG GGACAACGAC
ATCGTGCTGT CCATGAAGCA GGGCAACGTG GCGGCGCCCT CGCGTGTGGT GACCGACCAG
CGCATCATGG ACCACTTCCA GCACCACGGG CACCGCACCG CGATGTCCCA GCGGGCGCTC
CAGGCGCACG CGGACCCGCT GCTGGGCCAC ACCGAGATGG GCGGCGTGGG CTTCGTGGTG
AGCGAGGTCT CCCCCTACAC CAACGACCTG GACTGGGACG ACCTCACCGA GCCCGCGGAG
ATCGCGCCGG TGCTGGACTA CCTGGGGCGC GCCACCGCCA AGGCGCACTG CGTGTCCGAC
TCGCACGCGG ACGCCACGCT CGTGCGCGGT CAGACCGAGG AGGCGGTCAT GGCGGTGCTC
GACGGCCGCG AGGCGGAGTT CACGCGGTGG TGCGTGGACT TCGCGCACCG GTACGCGGCC
CAGACCCGGG CCGACTACTC GCTGTTCGTG GACGCCTTCC GCAACAACGC CATCAGCGCG
GTGAGGTCCA GCCGGGACCT GTGA
 
Protein sequence
MLDRLDSTDS SAERRDLIVA TLENAFSDLM SADPAAFRVK FRKMAANPFA FYRGSAALFY 
DDVSGMDDPW ADERTSRVWI QGDLHAENFG TYMDSTGRLV FDVNDFDEAY LGHFTWDVLR
FAASIGVMGW QKALSDEDIS ALLPHYVDAY IAQVREFATT GNDSEFSLKL GNTDGTVHDV
LQKTRLNSRA EMLSSMTTRD GYTRRFAEGP RARRLDDAER ERVLAAYEAY LGTIPEDRRY
ASINYAVKDV VGSGGFGIGS AGLPAYTLLI EGQSEAWDND IVLSMKQGNV AAPSRVVTDQ
RIMDHFQHHG HRTAMSQRAL QAHADPLLGH TEMGGVGFVV SEVSPYTNDL DWDDLTEPAE
IAPVLDYLGR ATAKAHCVSD SHADATLVRG QTEEAVMAVL DGREAEFTRW CVDFAHRYAA
QTRADYSLFV DAFRNNAISA VRSSRDL