Gene Ndas_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3971 
Symbol 
ID9247842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4749604 
End bp4751352 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content75% 
IMG OID 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_003681874 
Protein GI297562900 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGGCGC GTCCCCTCGT GGCGCGCGTG ACCGCCGCGG CGGAGCGGGA GCGGGCGGGA 
CTGCGCCGCG TGCTGCGCGG CGGGGCGGTC AACATGGCGG GGGCCGTGGT CGGCGCCGCG
CTGAACCTCG CCGTGATCGT GACGATCACT CGGGCGTTCT CCCAGGAGAC GGCGGGGCTG
CTGTTCTCGG CGACGTCAGT GTTCCTGATC GCCGCGGTGG TGGCCAACCT GGGCGCGTCG
GACGGGTTGG TGTACTTCAT CGCCCGGATG CGCGTGTTCG GCGAACCCGG GGGCGTGCCC
CGGCTGCTGC GGACGGCCGC GGCCCCGGCC GTGCTGGCGG CGTGCGCGCT GGCGGTGCTG
CTGGTGGTGT GCGCCGGTCC GGTGGCGCGG GGCCTGGGCG GCGGCGAGGC GGAGGTCTAC
CTGCGGCTGC TGGCGGTGTT CCTGCCGTTC GCGGTGCTGG CGGACACGGC CCTGGCCGCG
ACGCGGGCCC ACCACGACAT GGCCGGGACC GTGCTGGTGG ACAAGGTGGG CCGCCCGCTG
GCCCAGCTGG CCCTGGTGAC GGGGGTCGCG CTGTCGGGGG CGGCGGGGCT GTTGGCGCTG
GCGTGGGCCG GGCCGTACCT GCCCGCGGCC GTGGTGGCGT GGTTCTGGTT GGGACGTGTC
GTGCGCCGGG CCTTTCCCGA GGCCGGTGGG GCCTCCGGGG ATGCGGACGG ATCCGGGGGT
GCGCACGGTT CCAGGGGTGC GGATGCCCTC GGGAAGACGC ACGGCTCCGG GGATGCGGAT
GCCCTCGGGA AAGCGGCGCC CGTGGGGAAG CGGGGGACTT ATGGTGCCCC CAGTGCCACT
GGGACGGCTG TGGATTCGGA GATCGGCCGC GCCCGTGAGG AGTCCGCGAC CACGGAGACG
GCCGTGGCTC CGGAGGCCGT GGAGGAGAGC GTGGAGCGGG TGGAGGCGCG GACGTTCTGG
GCCTTCTCCC TGCCGCGTGC GGTGGCCGCG GTCGCCCAGA TGGGCGTGCA GCGCGGCGGC
GTGGTCCTCG TGGCGCTCCT GGGCGGGCTG ACCGGCGCGG CGGTGTTCAC GGCGGCCACG
CGGGTCATGG TGGTCGGCCA GTTCGGCACG CAGGCGGTGC TGTACGCGGC CCAGCCCAGG
TTCGCCGAAC AGCTGGCGAC GGGCGACCAC GCCGGGGTCC GGGCCCTCTA CCAGGCGGGA
ACGGCGTGGC TGGTGTGCCT TCTGTGGCCT TTGTACCTGT CGGTTCTGGT GTTCGCCCCC
CAGGTGATGC GGCTGTTCGG GCCGGAGTAC GCGGCGGGGG CGACGGCGCT GGTCGTGGTG
TGTGCGGGCC AACTGACGGC CGCCGCGCTG GGGATGAGCG ACCTGGTCCT GACGATGACC
GGGCTGACCC GGCTCAACCT GGTCAACAAC GTGCTGTCGC TGGCGGCGAA CGTGCTGGTG
TGCGTGCTGC TCGTACCCGT GGCCGGAGCG ACCGGGGCGG CCGTGGCCCT GGTGGCCGCG
ATGCTGGTGC GCAAGCTGCT CCCGCTGTGG CAGTTGCGGT CCCACGTGGT GCTGCACCCG
TTCAGCCGCC CGGTGCTGGC CGCGACCGCC AGCGCGCTGA CGTGGTTCGG CGTCCTGCCG
CTGCTGCTGG AGGCACTGCT GGGCGGCGGG ATCGCGACGC TGGCGATGGC GGTGGCCTCG
GGAGCGGTGG GGCACCTGGT GACGGTGTGG TCGCTGCGCG GACTGCTGGG CCTGGACCCG
CGCCGGTGA
 
Protein sequence
MGARPLVARV TAAAERERAG LRRVLRGGAV NMAGAVVGAA LNLAVIVTIT RAFSQETAGL 
LFSATSVFLI AAVVANLGAS DGLVYFIARM RVFGEPGGVP RLLRTAAAPA VLAACALAVL
LVVCAGPVAR GLGGGEAEVY LRLLAVFLPF AVLADTALAA TRAHHDMAGT VLVDKVGRPL
AQLALVTGVA LSGAAGLLAL AWAGPYLPAA VVAWFWLGRV VRRAFPEAGG ASGDADGSGG
AHGSRGADAL GKTHGSGDAD ALGKAAPVGK RGTYGAPSAT GTAVDSEIGR AREESATTET
AVAPEAVEES VERVEARTFW AFSLPRAVAA VAQMGVQRGG VVLVALLGGL TGAAVFTAAT
RVMVVGQFGT QAVLYAAQPR FAEQLATGDH AGVRALYQAG TAWLVCLLWP LYLSVLVFAP
QVMRLFGPEY AAGATALVVV CAGQLTAAAL GMSDLVLTMT GLTRLNLVNN VLSLAANVLV
CVLLVPVAGA TGAAVALVAA MLVRKLLPLW QLRSHVVLHP FSRPVLAATA SALTWFGVLP
LLLEALLGGG IATLAMAVAS GAVGHLVTVW SLRGLLGLDP RR