Gene Ndas_5249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5249 
Symbol 
ID9249146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp410380 
End bp411789 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content68% 
IMG OID 
Productcell cycle protein 
Protein accessionYP_003683135 
Protein GI297564162 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.168107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTAAGG CTTCCGAACC CGAGGCGCCC ACCGCGCTGC CACCCGTCAA GAGGCGCAAC 
GCCGAACTGG TCCTCATCCT CTTCGCCATC GTCATCACCA TGGCGGGCAT CGCGATCGCG
GGCGTCAACC TCAACGGCCA GGTCCCCGGC GCGATGTGGA CGGTGGGCCT GACCTTCGCG
GCCCTGTCCG CCGCCGCGCA CGTGGCGATG CGCTTCGTGG CGCCCTACGC CGACCCGCTG
ATCCTGCCCT GTGCGCTGTT CCTCAACGGC ATCGGCGTGG CGATGATCTG GCGGATCCAG
GCGGGCGAGG CCGAGGATAT CGAGCGCGCC GGGGTCGGCT CGCAGCTGAT GTGGACGGCG
ATCGGCCTGG TCCTGTGCTT CCTCATCATC ATCTTCCTCA AGGACCCCAG GGTCCTCCAG
CGCTACACCT ACGTCAGCGG CCTGGTCGCG ATCATCCTCC TCGCCCTGCC GATCATCCCC
GGCCTGGGCC AGGAGGTGTA CGGCGCCCGG CTGTGGATCG GCATCGGGCC GTTCACCATG
CAGCCGTCGG AGTTCGCCAA GATCGCCCTG GTGATCTTCC TGGCGTCCTA TCTGATGAGC
AAACGCCAGG TGCTCCAGAT CGTCGGCAAG CCGATCAAGA TCGGCCGCTT CACGCTCATC
GAACTGCCCC GGGCGCGCGA CCTGGCACCG ATCCTGGTCG GCTGGGTGCT GGCCATCGGC
ATGCTGGTGC TCCTGCGCGA CCTGGGCACC TCGCTGCTGC TCTTCGGCAC GTTCCTGGCG
ATGCTGTACG TGGCCACCCA GCGCTCCTCG TGGGTGACCA TCGGCCTGCT CCTGTTCGCG
GCCGGGGCGT TCGTGGCCTA CCTGCTCTTC TGGCACGTCC AGGCCCGCGT CAACATCTGG
CTCAACGCCT TCGACCAGGA GGTGTACGAG GCGGTCGGCG GCAGCCAGCA GCTGGTGGAG
GGGCTGGTGG GCATGGCCTA CGGCGGCCTC TTCGGCACCG GCATGGGCGC GGGGGCCCTG
TACGACACCT TCGCCGCCGA CAGCGACCTC ATCCTCGCCA CGATCGGCGA GGAGCTGGGC
CTGACCGGCC TGCTCGCCAT CCTCATGGTG CTCGGCCTGT TGGTGGAGCG CGGCATGCGC
ATGGCGCTCG CGACCACCGG CGCGTTCAAC AAGCTGCTGG CCAGCGGTGT CGCCTTCCTT
CTCGCCTACC AGGTCTTCAT CGTCCTGGGC GGCCTGACCC GGGTGATCCC GCTGACCGGC
TCGACCACGC CCTTCATGGC GGCGGGCGGT TCCGCGCTGC TGGCGAACTG GATCATGATG
GGGATCCTGC TCCGCATCAG TGACAACGCC AGGCGCCCAG CACCGCAGGC CATCCAGGAC
GAGGGCGCGA CACAGGTGAT CCGACGATGA
 
Protein sequence
MSKASEPEAP TALPPVKRRN AELVLILFAI VITMAGIAIA GVNLNGQVPG AMWTVGLTFA 
ALSAAAHVAM RFVAPYADPL ILPCALFLNG IGVAMIWRIQ AGEAEDIERA GVGSQLMWTA
IGLVLCFLII IFLKDPRVLQ RYTYVSGLVA IILLALPIIP GLGQEVYGAR LWIGIGPFTM
QPSEFAKIAL VIFLASYLMS KRQVLQIVGK PIKIGRFTLI ELPRARDLAP ILVGWVLAIG
MLVLLRDLGT SLLLFGTFLA MLYVATQRSS WVTIGLLLFA AGAFVAYLLF WHVQARVNIW
LNAFDQEVYE AVGGSQQLVE GLVGMAYGGL FGTGMGAGAL YDTFAADSDL ILATIGEELG
LTGLLAILMV LGLLVERGMR MALATTGAFN KLLASGVAFL LAYQVFIVLG GLTRVIPLTG
STTPFMAAGG SALLANWIMM GILLRISDNA RRPAPQAIQD EGATQVIRR