Gene Ndas_5521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5521 
Symbol 
ID9249424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp711937 
End bp713751 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content71% 
IMG OID 
ProductTRAG family protein 
Protein accessionYP_003683406 
Protein GI297564433 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.197372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.646541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGGC GCAGACCCAC GTACCAGGTG CGCGGCGGAG CGGCGGCGGC CGGAGCCCCG 
CCGCTGCTCG TGCTGCTCGC CCTGTGGGGC CTGGTGGCCC TGTTCTTCCT CTTCTGGCTG
TCGGCGCGGC TCGTGGCCGC CGTAACCGGG GGCGCGGTCG GCGAGTTCGG CCCGATGTGG
GCGTACTCGC TGCTGGTCTG GGACACCGGG GCGACCTGGC CGGGCACGCC GAGCGCCCCG
GTCGCCGTGG TGTTCTCCGT CCTGGCCGCG GCTGCGGTCA GCGCCCTCTG GTGGATCCTG
GTCCGGGTCA GGAACGCGCT GTACGGACAG CCCGACGCGG TCGCCGCCCT GAACAGGAAC
AACGAGACGA TCGCCGAGCT GGCCCGCCCC GAGGCCGCGC AGAAGGCGGT CCGGCTGCGC
GGACGTTCGC TGCGCGGCAT CAGGAGCGGG ACGCTCGCCG ACGCCGACGT CGGCCTGGTC
ATCGGCGACG TGCGGGGGTC GGGCAGACGC GACGGGCCCC GGCTGTTCGC GTCGTGGGAG
GACACCGTCC TCGCCTACAT GGCGCCCCGC GCGGGCAAGA CCACGGCGAT GGCCATCCCC
TACGTCCTGG ACGCGCCCGG TCCTGCGCTG GCCACCAGCA ACAAGGCCGA CGTCTGGTCG
GCCACGGCGA AGATCCGGGA GCAGGCCACC GGCGACCGGG TCTGGCTGTT CGACCCCCAG
CACATCACCC ACCAGGAGCA GGACTTCTGG TGGAACCCGC TGGCCGGTGT GCGCAGCGTC
GAGGACGCCT ACCGGCTCGC CGGGCACTTC GTCCTCACCA TCGACGACGA CTCCAAGAAG
GACATGTGGG GTCCGGCGGC CCGGGCTCTG CTCTCCCAGC TCATGCTCGC CGCGGCGCTC
GGGGGCGAGT CCCTGGCCAG GGTCGGGGAG TGGCTGCACG ACACCAAGCT GCCCCAGCCG
GTGGACATCC TCTTCGAGCA CGGGTTCACG GCCTACGCCG AGGCACTGCG CGAGACGCAG
AACATCGTCG CCGAGACGCG TGACGGCATC TACACGACCG CACGCACCGC CGCCCGCTGT
CTGGACGACC CCGAGATCAT GGCGTGGGTG ACCCCGCCGG ACGGCTCGGC TTACCACGAG
TTCGACCCGC GCTCCTTCGT CACCACCAAA CAGACGCTCC ATCTGCTCAG CAAGTCCCGC
GCCGCCGCGG CCCCGCTCAT CGCCGCCCTC ACCGACGCGG TCTTCATCGC GGGAGAGGAG
GCGTCGGAGG GGCAGGGCGG CAGGCTCGAC CCGCCGCTGG TGGCCGTGCT GGACGAGGCC
GCCAACATCT GCAAGATCGC GGACCTGCCC GACATGTACT CCCACCTGGG ATCGCGCGGC
ATCGTCCCGG TCACCATCCT CCAGAGCTAC CGGCAGGGGG TGCGGGTCTG GACCGAGAAC
GGCATGGAGG CCATGTGGTC CGCCGCCACG GTGAAGGTGT TCGGCGCCGG GCTCGACGAC
CACAAGATCG TGGACGCGCT GTCCAAGCTG ATCGGCCAGC ACGACATCTC CACCACGTCG
TTCAGCTACG GCGAGGGCAA GGGCAACCAC TCCGTCCAGC TCCGGCGCCA GGAGATCATG
CAGGGCTCCG ACATCCGCAG GATCGACAAG GGCGAGTGCC TGCTCTTCGC CACGGCCGCC
CAGCCCACGA TCCTGCGGAT GCGCCCCTGG TACAGGACCG ACAGGGCCAG GATCGTCTCC
GCCGCGATCA AGGAGGCCGA GGAGCGGATC ACCACCAGGG CCCGCGTGCG CTACGACGCG
CCCCGCGGAA GGTGA
 
Protein sequence
MARRRPTYQV RGGAAAAGAP PLLVLLALWG LVALFFLFWL SARLVAAVTG GAVGEFGPMW 
AYSLLVWDTG ATWPGTPSAP VAVVFSVLAA AAVSALWWIL VRVRNALYGQ PDAVAALNRN
NETIAELARP EAAQKAVRLR GRSLRGIRSG TLADADVGLV IGDVRGSGRR DGPRLFASWE
DTVLAYMAPR AGKTTAMAIP YVLDAPGPAL ATSNKADVWS ATAKIREQAT GDRVWLFDPQ
HITHQEQDFW WNPLAGVRSV EDAYRLAGHF VLTIDDDSKK DMWGPAARAL LSQLMLAAAL
GGESLARVGE WLHDTKLPQP VDILFEHGFT AYAEALRETQ NIVAETRDGI YTTARTAARC
LDDPEIMAWV TPPDGSAYHE FDPRSFVTTK QTLHLLSKSR AAAAPLIAAL TDAVFIAGEE
ASEGQGGRLD PPLVAVLDEA ANICKIADLP DMYSHLGSRG IVPVTILQSY RQGVRVWTEN
GMEAMWSAAT VKVFGAGLDD HKIVDALSKL IGQHDISTTS FSYGEGKGNH SVQLRRQEIM
QGSDIRRIDK GECLLFATAA QPTILRMRPW YRTDRARIVS AAIKEAEERI TTRARVRYDA
PRGR