Gene Ndas_4117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4117 
Symbol 
ID9247991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4916757 
End bp4917944 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content73% 
IMG OID 
Productsuccinyl-CoA synthetase, beta subunit 
Protein accessionYP_003682018 
Protein GI297563044 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCTGT ACGAGTACGA GGCCAAACAG CTCTTCGGGG AGTACGGCGT CCCCCTCGTC 
GAGGGCGAGA TCGCGGACAC CCCCGAACAG GCCCGGCTGG CGGCCGGACG GATCGGCCAC
CGGGTGGTGG TCAAGGCGCA GGTCAAGACC GGTGGTCGCG GCAAGGCCGG CGGCGTCAAG
GTCGCCGAGG GCCCCGAGGA CGCCGGGGCC AGGGCCGAGC AGATCCTCGG CATGGACATC
AAGGGCCACA CCGTCCGCCG CGTCCTCATC GAGGAGGCCT CCGACATCGC GGAGGAGTAC
TACTTCTCCT TCCTGCTGGA CCGCGCGAAC CGCACCTTCC TCTCGATCTG CTCCGCCGAG
GGCGGCATGG ACATCGAGGA GGTCGCCCGG ACCCGGCCCG AGGCGGTCGT GCGCACCCCG
GTCGGCCCCG GGGGCGTGGA CCACGGGGCC GCCCTCGCGA TCTGCCGCGC GGCCGGGCTG
CCCGAGGAGG TGCGCGACTG CGCGGCCCAG GTGGTCACCC GGCTCTGGCA GGTCGCCGTC
GGGGAGGACG CCACCCTCGT CGAGGTCAAC CCCCTGGTCC GCACGGCCGA CGGGCGGATC
ATCGCCCTGG ACGGCAAGGT CACCCTGGAC GGCAACGCCG CCTTCCGCCA CCCCGAGCGG
ACCCCGTTCG CCGACGGCGC CGACACCGAC GAGCGCGAGC GCATGGCCAG GGCCAGGGGC
CTGAACTACG TCAGGCTCGA CGGCGAGGTC GGCGTCATCG GCAACGGCGC GGGTCTGGTC
ATGTCCACCC TGGACGTGGT CGCCCACGCG GGCGGGGCGC ACGGCGGGGT GCGACCGGCC
AACTTCCTGG ACATCGGCGG CGGGGCCTCG GCCGAGGTCA TGGCCAACGG CCTGGAGATC
GTCCTGGGCG ACCCCTCGGT CAGGAGCGTC CTGGTCAACG TCTTCGGCGG CATCACCGCC
TGCGACGCGG TGGCCGAAGG CATCGTCCGG GCCCTGGACA TGCTGGAGGG CCGCAGCGGC
GACGAGGGCT TCGATCAGCT CGGCAAGCCG CTGGTCGTGC GCCTGGACGG CAACAACGCC
GAGCTGGGCC GCGAGATCCT CACCAAGCGG GCCCACCCGG CCGTGCAGCA GGTGGACACC
ATGGACGGCG CCGCCGCCCG GGCCGCCGAG CTCGCGGCCG CCAACTGA
 
Protein sequence
MDLYEYEAKQ LFGEYGVPLV EGEIADTPEQ ARLAAGRIGH RVVVKAQVKT GGRGKAGGVK 
VAEGPEDAGA RAEQILGMDI KGHTVRRVLI EEASDIAEEY YFSFLLDRAN RTFLSICSAE
GGMDIEEVAR TRPEAVVRTP VGPGGVDHGA ALAICRAAGL PEEVRDCAAQ VVTRLWQVAV
GEDATLVEVN PLVRTADGRI IALDGKVTLD GNAAFRHPER TPFADGADTD ERERMARARG
LNYVRLDGEV GVIGNGAGLV MSTLDVVAHA GGAHGGVRPA NFLDIGGGAS AEVMANGLEI
VLGDPSVRSV LVNVFGGITA CDAVAEGIVR ALDMLEGRSG DEGFDQLGKP LVVRLDGNNA
ELGREILTKR AHPAVQQVDT MDGAAARAAE LAAAN