Gene Ndas_2565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2565 
Symbol 
ID9246416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3058289 
End bp3059977 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content77% 
IMG OID 
Product2-succinyl-6-hydroxy-2,4-cyclohexadiene-1- carboxylic acid synthase/2-oxoglutarate decarboxylase 
Protein accessionYP_003680490 
Protein GI297561516 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.407494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0898243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCGT CCACCGCCCT GGCGCGTGTC CTGGTCGACG AGCTGGCCCG CTGCGGCCTG 
GCCGAGGCCG TCGTCGCGCC GGGGTCGCGC TCGACGCCGC TCGCCCTGGC CCTGGTCGCC
CACCCCGGAA TCCGGGTGCA CGTGCGCATC GACGAGCGCT CGGCCTCCTT CCTCGCCCTC
GGTCTGGCCC GCGTCTCGCG CCGCCCGGTC GCGGTGGTGT GCACCTCCGG CACGGCCGCG
GCCAACTTCC ACCCGGCGGT GATGGAGGCG GACGAGAGCG GGGTGCCGCT GCTGGTGCTC
ACCGCCGACC GGCCCCCGGA GCTGCGCGGG ACCGGCGCCA ACCAGACGGT GGACCAGATC
GGCCTGTACG GCGGCGCGGT CCGCCTGTTC GCCGAGGTCG GCACCCCCGA CCCGGTGCCC
GGCATGGTCG CCTACTGGCG GTCGCTGGCC TGCCGCGCGT GGGGCTCGGC GCTGGGCGGA
CGCCCCGGTC CGGTCCACCT CAACGTGGCC TTCCGCGACC CGCTGACCCC CGACGCCGAC
ACGGGCGCGT GGCCCGAGCC CCTGGAGGGG CGCGACGGCG GCCGGGCGTG GATCGGCCGC
CCGGGGCCGC TGACCGAACC CGCGCCGTTC GCCCTGCCCG ACGTCGAGCG CGGTGTGATC
GTGTGCGGCG ACGGGGACTA CGACCCGGTG CCGTTCCTGG CGCTGGCCGA GGCGACCGGG
TGGCCCCTGC TGGCCGAACC GACCTCCAAC GCCCGCCGGG CGGGCGCGCT GTCCACCTAC
CGGCACTTGC TGGCCTCGCC GCGCGTGGTC GCGCGGCCGG CGCCGGAGCT GGTGGTGAGC
GTGGGCCGCC CCAACCTGTC CCGGCAGATC CTGGCCTACC TGCGCCGGGC CGAACGGCAC
GTGGTGGTCG GCGCGGGCGC GCTCGACGCC TTCTCCGACC CGGTGCGCAC CGCCACCGAC
GTGGTGGCCG CGGTCGCGCC GCCCGCCGGT CTGGACCCCG GCTCCCCGCG GAGCACCGAG
TGGTCGCGGA CCTGGTCGGA GGCGGAGGCG GTGGCCCGCG CCGCCCTGGA CGCGGTGCTG
GACGAGGAGG AGGTGCTCAG CGAGCCGCGG CTGGCCCGCG ACCTGGTCGC GCACATGTCC
ACCGGTTCGC TGCTGTTCGC CGGTTCGAGC ATGCCCATCC GCGACCTGGA CGCCACCATG
CGGGCGCGCT GCGGCGTGCG CCTGGTGGGC AACCGGGGCG TCAGCGGGAT CGACGGGTCG
GTCTCCACGG CCATCGGCGC GGCGCTGGCC CACCAGGCCG GGGGCGGGGG GCACGCCTAC
GCGTTGCTGG GCGACCTCGC GATGCTGCAC GACCAGAACG GGCTGCTGAT CGGCCCGGGG
GAGCCTCGCC CGGACCTGGC GATCGTCGTG GTCAACAACG ACGGCGGCGG GATCTTCTCC
GGTCTGGAGC AGGCCGGGCA CCCCGACTTC GAGCGGGTGT TCGGCACCCC GCACGGGGCG
TCGATGGAAC GGGTGGCGGC GGTGGCCGAC GTGCCCTACA CCCGCCTGGA GTGGGCGACC
GACCTGCCCA AGGCGCTGCT GGGCGAGGGG CCGCGCCTGA TCGAGGTGTG CACGCACCGC
GCGGGCAGCG CCGCGCTGCG CCGCCGGATC CAGGCCGCGG TGGACGCCGC GGTGGACGGG
GCGCTCTGA
 
Protein sequence
MNPSTALARV LVDELARCGL AEAVVAPGSR STPLALALVA HPGIRVHVRI DERSASFLAL 
GLARVSRRPV AVVCTSGTAA ANFHPAVMEA DESGVPLLVL TADRPPELRG TGANQTVDQI
GLYGGAVRLF AEVGTPDPVP GMVAYWRSLA CRAWGSALGG RPGPVHLNVA FRDPLTPDAD
TGAWPEPLEG RDGGRAWIGR PGPLTEPAPF ALPDVERGVI VCGDGDYDPV PFLALAEATG
WPLLAEPTSN ARRAGALSTY RHLLASPRVV ARPAPELVVS VGRPNLSRQI LAYLRRAERH
VVVGAGALDA FSDPVRTATD VVAAVAPPAG LDPGSPRSTE WSRTWSEAEA VARAALDAVL
DEEEVLSEPR LARDLVAHMS TGSLLFAGSS MPIRDLDATM RARCGVRLVG NRGVSGIDGS
VSTAIGAALA HQAGGGGHAY ALLGDLAMLH DQNGLLIGPG EPRPDLAIVV VNNDGGGIFS
GLEQAGHPDF ERVFGTPHGA SMERVAAVAD VPYTRLEWAT DLPKALLGEG PRLIEVCTHR
AGSAALRRRI QAAVDAAVDG AL