Gene Ndas_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1116 
Symbol 
ID9244966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1369603 
End bp1370787 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content75% 
IMG OID 
Productmannose-6-phosphate isomerase, class I 
Protein accessionYP_003679063 
Protein GI297560089 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0922592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGGC TCACCAACCA GGTAAGGCCT TACGCCTGGG GAGCGCGCAC GGCCATCCCG 
CGGCTGCTGG GCGCCGAACC CGACGGCACG CCCCAGGCCG AACTGTGGCT GGGGGCCCAC
CACGGCGCGC CCAGCACGGC GCACTGCGAG GACGGTCCGC GCCCCCTGCC CGGCCTCATC
GCCGACAACC CCGACCGCGT CCTGGGGCGG CGCACCGCCG AGCGCTTCGG CGGGCGGCTG
CCCTTCCTCC TCAAGGTCCT GGCCGCCGAG GCGCCCCTGT CCCTCCAGGT CCACCCCGAC
GCCGTCCGCG CCCGCGCCGG GTTCGAGGCC GAGGAACGCG CGGGCATCCC CCTGGACGCC
CCCCACCGCA ACTACCGCGA CCCCCACCAC AAGCCCGAAC TCCTCCTCGC CCTGGAGCCC
TTCGAGGCGC TGTGCGGCTT CCGCGAACCC GCCGCCGCCC GCGCCGACCT GCGCGGACTC
ACCTGCGAAC TGGCCGTGGC GCTGCGCGGC GACCTCGCCC TGACCGACGC CGGAACCGCG
CTGCGCGGCG CCCTCACCCG CCTGCTCACC CTCACCGAGG GCGAGCGCGC CCGGCTGCTG
GACGACTTCG TGCGCGAGTG GTCGGACTCC GGCCCGCGCG GATCCCACGG CGCCATCGTC
GCCGACCTGG CCGAGCGCTA CCCGGGCGAC CCCGGCGCGG TGGCCGCCCT GCTGCTCAAC
CGGGTCACGC TCTGGCCGGG CCAGGCCCTG TTCCTGCCCG CGGGGAACAT GCACGCCTAC
CTCCAGGGCA CCGCGGTCGA GGTGATGGCC AGCTCCGACA ACGTGCTGCG CGCCGGGCTG
ACCGGCAAGC ACGTGGACGC GCCCGAACTG CTGGACGTGG TGGACTTCTC GGTGCTGCCC
ATCCCCTACG CCAGACCCGG GGTCTGCGAG GGGCGCCGGG AGTTCCGCAC GGCCGCCCCG
GAGTTCGCGC TGCACGAGAT CGGCCCCGGC CGCATCACGG CCCACCTGCC GGGGGAGGGG
CCGACCGTGC TGCTCGCCCT GCACGGGCAG GTGGAGTTGG TCGCCGAGGT CGGTCAGGGG
ATGACCCTCC AGCGCGGTGA GTCGGTGTTC GTGCAGGCCG ACAGCGGACC GATCAAGGTC
GAGGGCCACG GCCACGTCAT CGCCGCCACC GTCGGCGATA TCTGA
 
Protein sequence
MHRLTNQVRP YAWGARTAIP RLLGAEPDGT PQAELWLGAH HGAPSTAHCE DGPRPLPGLI 
ADNPDRVLGR RTAERFGGRL PFLLKVLAAE APLSLQVHPD AVRARAGFEA EERAGIPLDA
PHRNYRDPHH KPELLLALEP FEALCGFREP AAARADLRGL TCELAVALRG DLALTDAGTA
LRGALTRLLT LTEGERARLL DDFVREWSDS GPRGSHGAIV ADLAERYPGD PGAVAALLLN
RVTLWPGQAL FLPAGNMHAY LQGTAVEVMA SSDNVLRAGL TGKHVDAPEL LDVVDFSVLP
IPYARPGVCE GRREFRTAAP EFALHEIGPG RITAHLPGEG PTVLLALHGQ VELVAEVGQG
MTLQRGESVF VQADSGPIKV EGHGHVIAAT VGDI