Gene Ndas_2381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2381 
Symbol 
ID9246231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2829949 
End bp2831982 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content74% 
IMG OID 
Productrhamnulose-1-phosphate aldolase/alcohol dehydrogenase 
Protein accessionYP_003680308 
Protein GI297561334 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.932979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGATG TCGTCGAACA GCTCCTCGCC CGCAGCAACA CCCTCGGCGC CGACCCGCGC 
AACACCAACT TCGCCGGGGG CAACACCTCC GCCGCGGACA CCCGCACCGA CCCCGTCACC
GGACAGGACG TCGACCTGCT CTGGGTCAAG GGCTCCGGCG GCGACCTGGG CACCCTCACC
GAGGACGGCC TGGCCGTGCT GCGCCTGGAC CGCCTGCGCG CCCTCGTGGA CGTCTACCCC
GGCGAGGACC GCGAGGACGA GATGGTCGCC GCCTTCGACC ACTGCCTGTT CGGCAGGGGC
GGCGCCGCGC CCTCCATCGA CACCGCCATG CACGGCCTGC TGCGCGCCGC GCACGTGGAC
CACCTGCACC CCGACTCCGG CATCGCCCTG GCCACCGCCG CGGACGGCGA GCGCCTGACC
CGCGAGTGCT TCGGCGACCG CGTGGTGTGG GTGCCCTGGC GCCGTCCCGG GTTCCAGCTC
GGCCTGGACA TCGCCCGTAT CGCCGAGGAG AACCCCGACG CCATCGGCGC GATCCTGGGC
GGCCACGGCA TCACCGCCTG GGCCGAGACC AGCGAGCAGT GCCAGGCCAA CTCCCTGGAG
ATCATCCGCA CCGCCGAGGG GTTCTTGGAG GAGAACGGCC GCCCCGAGCC CTTCGGCCCC
GTCCTGGAGG GCTACGGCGC CCTGCCCGAG GCCGAGCGCC GCCAGCGCGC CGCCGCCCTG
GCCCCGGTCA TCCGCGGGCT GGCCTCCACC GACCACCCCC AGGTGGGCCG CTTCACCGAC
AACGACGTGG TCCTGGACTT CCTGGCCGGG GCCGAGCACC CCCGCCTGGC CGCGCTGGGG
ACCTCCTGCC CCGACCACTT CCTGCGCACC AAGGTCCGGC CCCTGGTGCT CGACCTGCCC
GCCGACGCCC CGCTGGAGCG GGCCGTGGAG CGCCTGCGCG AACTGCACGG GGAGTACCGG
GCCGAGTACC GCGCCTACTA CGAGCGCCAC GCCGACGCCG ACAGCCCCGC CATGCGCGGC
GCCGACCCGG CGATCGTGCT GGTCCCCGGG GTGGGCATGT TCTCCTTCGG CAAGGACGCC
AAGACCGCGC GCGTGGCGGG CGAGTTCTAC GTCAACGCGA TCAACGTGAT GCGCGGCGCC
GAGTCCGTCT CCACCTACCG GCCCATCGAG GAGTCGGAGA AGTTCCGCAT CGAGTACTGG
GCGCTGGAGG AGGCCAAGCT CGCCCGCCTG CCCGAGCCCA AGCCGCTCGC CGCCCGGGTC
GCCCTGGTCA CGGGCGCGGC CAGCGGTATC GGCAAGGCCA TCGCCGCCCG CCTGGCGCGC
GAGGGCGCCT GCGTGGTCGT GGCCGACCTG GACGCCGACA GGGCGGCCGC CGCCGCGGCC
GAACTGGGCG GCTCCGACAC GGCCGTGGGC GTGGCCTGCG ACGTCAGCGA CGCGGACGCG
GTGGCCCGCG CCTTCGCCGC GGCGGCCCTG GCCTTCGGCG GCGTGGACCT GGTGGTCAAC
AACGCCGGGC TGTCCATCTC CAAGCCGCTG CTGGAGACCA GCGAGCGCGA CTGGGACCTT
CAGCACGACG TCATGGCCAA GGGGTCCTTC CTGGTCTCGC GCGAGGCGGC CAGGACGATG
ACCGCCCAGG GCATGGGCGG CGACATCGTC TACATCGCCT CCAAGAACGC CGTGTTCGCC
GGTCCCAACA ACGTCGCCTA CTCCGCGGTC AAGGCCGACC AGGCCCACCA GGTGCGGCTG
CTGGCCGCCG AACTGGGCGG CGAGGGAATC CGGGTCAACG GCGTCAACCC CGACGGGGTG
GTGCGCGGCT CGGGCATCTT CGCCGGGGGC TGGGGCGCCC AGCGGGCCAA GGTGTACGGG
GTCAGGGAGG AGGACCTGGG CGCGTTCTAC GCCCAGCGCA CCATCCTGGG CCGCGAGGTG
CTGCCCGAGC ACGTGGCCAA CGCGGTGTTC GCGCTGACCG CGGGCGAGCT GTCGCACACC
ACCGGCCTGC ACATCCCCGT GGACAGCGGC GTCGCCGCGG CCTTCCTGCG ATGA
 
Protein sequence
MSDVVEQLLA RSNTLGADPR NTNFAGGNTS AADTRTDPVT GQDVDLLWVK GSGGDLGTLT 
EDGLAVLRLD RLRALVDVYP GEDREDEMVA AFDHCLFGRG GAAPSIDTAM HGLLRAAHVD
HLHPDSGIAL ATAADGERLT RECFGDRVVW VPWRRPGFQL GLDIARIAEE NPDAIGAILG
GHGITAWAET SEQCQANSLE IIRTAEGFLE ENGRPEPFGP VLEGYGALPE AERRQRAAAL
APVIRGLAST DHPQVGRFTD NDVVLDFLAG AEHPRLAALG TSCPDHFLRT KVRPLVLDLP
ADAPLERAVE RLRELHGEYR AEYRAYYERH ADADSPAMRG ADPAIVLVPG VGMFSFGKDA
KTARVAGEFY VNAINVMRGA ESVSTYRPIE ESEKFRIEYW ALEEAKLARL PEPKPLAARV
ALVTGAASGI GKAIAARLAR EGACVVVADL DADRAAAAAA ELGGSDTAVG VACDVSDADA
VARAFAAAAL AFGGVDLVVN NAGLSISKPL LETSERDWDL QHDVMAKGSF LVSREAARTM
TAQGMGGDIV YIASKNAVFA GPNNVAYSAV KADQAHQVRL LAAELGGEGI RVNGVNPDGV
VRGSGIFAGG WGAQRAKVYG VREEDLGAFY AQRTILGREV LPEHVANAVF ALTAGELSHT
TGLHIPVDSG VAAAFLR